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Abstract. In a probabilistic cellular automaton in which all local transitions have positive prob- 
ability, the problem of keeping a bit of information indefinitely is nontrivial, even in an infinite 
automaton. Still, there is a solution in 2 dimensions, and this solution can be used to construct 
a simple 3-dimensional discrete-time universal fault-tolerant cellular automaton. This technique 
does not help much to solve the following problems: remembering a bit of information in 1 dimen- 
sion; computing in dimensions lower than 3; computing in any dimension with non-synchronized 
transitions. 

Our more complex technique organizes the cells in blocks that perform a reliable simulation of 
a second (generalized) cellular automaton. The cells of the latter automaton are also organized 
in blocks, simulating even more reliably a third automaton, etc. Since all this (a possibly infinite 
hierarchy) is organized in "software", it must be under repair all the time from damage caused 
by errors. A large part of the problem is essentially self-stabilization recovering from a mess of 
arbitrary size and content. The present paper constructs an asynchronous one-dimensional fault- 
tolerant cellular automaton, with the further feature of "self-organization" . The latter means that 
unless a large amount of input information must be given, the initial configuration can be chosen 
homogeneous. 
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1. Introduction 

A cellular automaton is a homogenous array of identical, locally communicating finite-state au- 
tomata. The model is also called interacting particle system. Fault-tolerant computation and infor- 
mation storage in cellular automata is a natural and challenging mathematical problem but there 
are also some arguments indicating an eventual practical significance of the subject since there are 
advantages in uniform structure for parallel computers. 

Fault-tolerant cellular automata (FCA) belong to the larger category of reliable computing devices 
built from unreliable components, in which the error probability of the individual components is not 
required to decrease as the size of the device increases. In such a model it is essential that the faults 
are assumed to be transient: they change the local state but not the local transition function. 

A fault-tolerant computer of this kind must use massive parallelism. Indeed, information stored 
anywhere during computation is subject to decay and therefore must be actively maintained. It does 
not help to run two computers simultaneously, comparing their results periodically since faults will 
occur in both of them between comparisons with high probability. The self-correction mechanism 
must be built into each part of the computer. In cellular automata, it must be a property of the 
transition function of the cells. 

Due to the homogeneity of cellular automata, since large groups of errors can destroy large parts 
of any kind of structure, "self-stabilization" techniques are needed in conjunction with traditional 
error-correction. 



1.1. Historical remarks. The problem of reliable computation with unreliable components was 
addressed i n [p9| in the context of Boolean circuits. Von Neumann's solution, as well as its improved 
versions in ||9| and p^ , rely on high connectivity and non- uniform constructs. The best currently 
known result of this type is in where redundancy has been substantially decreased for the case 
of computations whose computing time is larger than the storage requirement. 

Of particular interest to us are those probabilistic cellular automata in which all local transition 
probabilities are positive (let us call such automata noisy), since such an automaton is obtained by 
way of "perturbation" from a deterministic cellular automaton. The automaton may have e.g. two 
distinguished initial configurations: say in which all cells have state and in which all have 
state 1 (there may be other states besides and 1). Let pi{x,t) be the probability that, starting 
from initial configuration ^i, the state of cell x at time t is i. If pi{x,t) is bigger than, say, 2/3 for 
all X, t then we can say that the automaton remembers the initial configuration forever. 

Informally speaking, a probabilistic cellular automaton is called mixing if it eventually forgets all 
information about its initial configuration. Finite noisy cellular automata are always mixing. In the 
example above, one can define the "relaxation time" as the time by which the probability decreases 
below 2/3. If an infinite automaton is mixing then the relaxation time of the corresponding finite 
automaton is bounded independently of size. A minimal requirement of fault-tolerance is therefore 
that the infinite automaton be non-mixing. 

The difficulty in constructing non-mixing noisy one-dimensional cellular automata is that even- 
tually large blocks of errors which we might call islands will randomly occur. We can try to design 
a transition function that (except for a small error probability) attempts to decrease these islands. 
It is a natural idea that the function should replace the state of each cell, at each transition time, 
with the majority of the cell states in some neighborhood. However, majority voting among the five 
nearest neighbors (including the cell itself) seems to lead to a mixing transition function, even in 
two dimensions, if the "failure" probabilities are not symmetric with respect to the interchange of 
O's and I's, and has not been proved to be non- mixing even in the symmetric case. Perturbations 
of the one-dimensional majority voting function were actually shown to be mixing in [p^ and jlTt. 



RELIABLE CELLULAR AUTOMATA 



5 



Non-mixing noisy cellular automata for dimensions 2 and higher were constructed in [ p7[ . These 
automata are also non-ergodic: an apparently stronger property (see formal definition later). All our 
examples of non- mixing automata will also be non-ergodic. The paper [l^ applies Toom's work 
to design a simple three-dimensional fault-tolerant cellular automaton that simulates arbitrary one- 
dimensional arrays. The original proof was simplified and adapted to strengthen these results in j^]. 

Remark 1.1. A three-dimensional fault-tolerant cellular automaton cannot be built to arbitrary size 
in the physical space. Indeed, there will be an (inherently irreversible) error-correcting operation on 
the average in every constant number of steps in each cell. This will produce a steady flow of heat 
from each cell that needs therefore a separate escape route for each cell. () 

A simple one-dimensional deterministic cellular automaton eliminating finite islands in the ab- 
sence of failures was defined in (see also ||]). It is now known (see |2^) that perturbation (at 
least, in a strongly biased way) makes this automaton mixing. 

1.2. Hierarchical constructions. The limited geometrical possibilities in one dimension suggest 
that only some non-local organization can cope with the task of eliminating finite islands. Indeed, 
imagine a large island of I's in the 1-dimensional ocean of O's. Without additional information, 
cells at the left end of this island will not be able to decide locally whether to move the boundary 
to the right or to the left. This information must come from some global organization that, given 
the fixed size of the cells, is expected to be hierarchical. The "cellular automaton" in |^ gives 
such a hierarchical organization. It indeed can hold a bit of information indefinitely. However, the 
transition function is not uniform either in space or time: the hierarchy is "hardwired" into the way 
the transition function changes. 

The paper |lO[ constructs a non-ergodic one-dimensional cellular automaton working in discrete 
time, using some ideas from the very informal paper of Georgii Kurdyumov. Surprisingly, it 
seems even today that in one dimension, the keeping of a bit of information requires all the organi- 
zation needed for general fault-tolerant computation. The paper |jll| constructs a two-dimensional 
fault-tolerant cellular automaton. In the two-dimensional work, the space requirement of the reliable 
implementation of a computation is only a constant times greater than that of the original version. 
(The time requirement increases by a logarithmic factor.) 

In both papers, the cells are organized in blocks that perform a fault-tolerant simulation of 
a second, generalized cellular automaton. The cells of the latter automaton are also organized 
in blocks, simulating even more reliably a third generalized automaton, etc. In all these papers 
(including the present one), since all this organization is in "software", i.e. it is encoded into the 
states of the cells, it must be under repair all the time from breakdown caused by errors. In the 
two-dimensional case, Toom's transition function simplifies the repairs. 

1.2.1. Asynchrony. In the three-dimensional fault-tolerant cellular automaton of [p^ , the compo- 
nents must work in discrete time and switch simultaneously to their next state. This requirement is 
unrealistic for arbitrarily large arrays. A more natural model for asynchronous probabilistic cellular 
automata is that of a continuous-time Markov process. This is a much stronger assumption than 
allowing an adversary scheduler but it still leaves a lot of technical problems to be solved. Informally 
it allows cells to choose whether to update at the present time independently of the choice their 
neighbors make. 

The paper ^ gives a simple method to implement arbitrary computations on asynchronous ma- 
chines with otherwise perfectly reliable components. A two-dimensional asynchronous fault-tolerant 
cellular automaton was constructed in ]30[ |. Experiments combining this technique with the error- 
correction mechanism of jl^ were made, among others, in 



6 



PETER GACS 



The present paper constructs a one-dimensional asynchronous fault-tolerant cellular automaton, 
thus completing the refutation of the so-called Positive Rates Conjecture in [ pO[ . 

1.2.2. Self-organization. Most hierarchical constructions, including ours, start from a complex, hi- 
erarchical initial configuration (in case of an infinite system, and infinite hierarchy). The present 
paper presents some results which avoid this. E.g., when the computation's goal is to remember a 
constant amount of information, (as in the refutation of the positive rates conjecture) then we will 
give a transition function that performs this task even if each cell of the initial configuration has 
the same state. We call this "self-organization" since the hierarchical organization will still emerge 
during the computation. 

1.2.3. Proof method simplification. Several methods have emerged that help managing the complex- 
ity of a large construction but the following two are the most important. 

• A number of "interface" concepts is introduced (generalized simulation, generalized cellular 
automaton) helping to separate the levels of the infinite hierarchy, and making it possible to 
speak meaningfully of a single pair of adjacent levels. 

• Though the construction is large, its problems are presented one at a time. E.g. the messiest 
part of the self-stabilization is the so-called Attribution Lemma, showing how after a while all 
cells can be attributed to some large organized group (colony), and thus no debris is in the 
way of the creation of new colonies. This lemma relies mainly on the Purge and Decay rules, 
and will be proved before introducing many other major rules. Other parts of the construction 
that are not possible to ignore are used only through "interface conditions" (specifications). 

We believe that the new result and the new method of presentation will serve as a firm basis 
for other new results. An example of a problem likely to yield to the new framework is the growth 
rate of the relaxation time as a function of the size of a finite cellular automaton. At present, the 
relaxation time of all known cellular automata either seems to be bounded (ergodic case) or grows 
exponentially. We believe that our constructions will yield examples for other, intermediate growth 
rates. 

1.2.4. Overview of the paper. 

• Sections ^, ^ |4| are an informal discussion of the main ideas of the construction, along with 
some formal definitions, e.g. 

— block codes, colonies, hierarchical codes; 

— abstract media, which are a generalization of cellular automata; 

— simulations] 

— amplifiers: a sequence of media with simulations between each and the next one; 

• Section ^ formulates the main theorems for discrete time. 

It also explains the main technical problems of the construction and the ways to solve them: 

— correction of structural damage by destruction followed by rebuilding from the neighbors; 

— a "hard-wired" program; 

— "legalization" of all locally consistent structures; 

• Section ^defines media, a specialization of abstract media with the needed stochastic structure. 
Along with media, we will define canonical simulations, whose form guarantees that they are 
simulations between media. We will give the basic examples of media with the basic simulations 
between them. 

The section also defines variable-period media and formulates the main theorems for con- 
tinuous time. 
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Section ^ develops some simple simulations, to be used either directly or as a paradigm. The 
example transition function defined here will correct any set of errors in which no two errors 
occur close to each other. 

We also develop the language used for defining our transition function in the rest of the 
paper. 

A class of media for which nontrivial fault-tolerant simulations exist will be defined in Section ^. 
In these media, called "robust media", cells are not necessarily adjacent to each other. The 
transition function can erase as well as create cells. 

There is a set of "bad" states. The set of space-time points where bad values occur is called 
the "damage" . The Restoration Property requires that at any point of a trajectory, damage 
occurs (or persists) only with small probability (e) . The Computation Property requires that 
the trajectory obey the transition function in the absence of damage. 

It is possible to tell in advance how the damage will be defined in a space-time configuration 
rj* of a medium M2 simulated by some space-time configuration of a medium Mi . Damage 
is said to occur at a certain point (x, t) of rj* if within a certain space-time rectangle in the 
past of {x,t), the damage of rj cannot be covered by a small rectangle of a certain size. This 
is saying, essentially, that damage occurs at least "twice" in 77. The Restoration Property for 
77 with e will then guarantee that the damage in rj* also satisfies a restoration property with 
with « e^. 

Section ^ introduces all notions for the formulation of the main lemma. First we define the kind 
of amplifiers to be built and a set of parameters called the amplifier frame. The main lemma, 
called the Amplifier Lemma, says that amplifiers exist for many different sorts of amplifier 
frame. The rest of the section applies the main lemma to the proof of the main theorems. 
Section |l^ defines self-organizing amplifiers, formulates the lemma about their existence and 
applies it to the proof of the existence of a self-organizing non-ergodic cellular automaton. 
Section |l^ gives an overview of an amplifier. As indicated above, the restoration property will 
be satisfied automatically. In order to satisfy the computation property, the general framework 
of the program will be similar to the outline in Section |^. However, besides the single-error 
fault-tolerance property achieved there, it will also have a self-stabilization property. This 
means that a short time after the occurrence of arbitrary damage, the configuration enables 
us to interpret it in terms of colonies. (In practice, pieces of incomplete colonies will eliminate 
themselves.) In the absence of damage, therefore, the colony structure will recover from the 
effects of earlier damage, i.e. predictability in the simulated configuration is restored. 
Section |l^ gives the rules for killing, creation and purge. We prove the basic lemmas about 
space-time paths connecting live cells. 

Section O defines the decay rule and shows that a large gap will eat up a whole colony. 
Section |lj proves the Attribution Lemma that traces back each non-germ cell to a full colony. 
This lemm a exp resses the "self-stabilization" property mentioned above. The proof starts with 
Subsection 14.1 showing that if a gap will not be healed promptly then it grows. 
Section |l^ proves the Healing Lemma, showing how the effect of a small amount of damage 
will be corrected. Due to the need to restore some local clock values consistently with the 
neighbors, the healing rule is rather elaborate. 

Section |6| introduces and uses the error-correcting computation rules not dependent on com- 
munication with neighbor colonies. 

Section ^ introduces and applies the communication rules needed to prove the Computation 
Property in simulation. These are rather elaborate, due to the need to communicate with not 
completely reliable neighbor colonies asynchronously. 

Section |l^ defines the rules for germs and shows that these make our amplifier self-organizing. 
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The above constructions will be carried out for the case when the cells work asynchronously (with 
variable time between switchings). This does not introduce any insurmountable difficulty but makes 
life harder at several steps: more care is needed in the updating and correction of the counter field of 
a cell, and in the communication between neighbor colonies. The analysis in the proof also becomes 
more involved. 

Acknowledgement. I am very thankful to Robert Solovay for reading parts of the paper and finding 
important errors. Larry Gray revealed his identity as a referee and gave generously of his time to 
detailed discussions: the paper became much more readable (yes!) as a result. 

2. Cellular automata 

In the introductory sections, we confine ourselves to one-dimensional infinite cellular automata. 

Notation. Let M be the set of real numbers, and the set of remainders modulo m. For m = oo, 
this is the set Z of integers. We introduce a non-standard notation for intervals on the real line. 
Closed intervals are denoted as before: [a,b] = {x : a < x < b}. But open and half-closed intervals 
are denoted as follows: 

[a+, b] = {x : a < X < b}, 

[a, b—] = {x : a < X < b}, 

[a+, b—] = {x : a < X < b}. 

The advantage of this notation is that the pair (x, y) will not be confused with the open interval 
traditionally denoted {x, y) and that the text editor program will not complain about unbalanced 
parentheses. We will use the same notation for intervals of integers: the context will make it clear, 
whether [a, b] or [a, 6] fl Z is understood. Given a set A of space or space-time and a real number c, 
we write 

cA = {cv: V € A}. 

If the reader wonders why lists of assertions arc sometimes denoted by (a) , (b) , . . . and sometimes 
by (1), (2), . . . , here is the convention I have tried to keep to. If I list properties that all hold or are 
required (conjmiction) then the items are labeled with (a),(b), . . . while if the list is a list of several 
possible cases (disjunction) then the items are labeled with (1), (2), .... 

Maxima and minima will sometimes be denoted by V and A. We will write log for log2. 

2.1. Deterministic cellular automata. Let us give here the most frequently used definition of 

cellular automata. Later, we will use a certain generalization. The set C of sites has the form Z,„ 
for finite or infinite m. This will mean that in the finite case, we take periodic boundary conditions. 
In a space-time vector {x,t), we will always write the space coordinate first. For a space-time set 
E, we will denote its space- and time projections by 

(2.1) TTsE,7rtE 

respectively. We will have a finite set S of states, the potential states of each site. A (space) 
configuration is a function 

ax) 

for X G C. Here, ^{x) is the state of site x. 

The time of work of our cellular automata will be the interval [0, oo— ]. Our space-time is given 

by 

V = C X [0,00-]. 
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A space-time configuration is a space-time function rj(x, t) which for each t defines a space configu- 
ration. If in a space-time configuration rj we have t]{x,v) = S2 and r]{x,t) = si ^ for all t < 
sufficiently close to v then we can say that there was a switch from state si to state S2 at time v. 
For ordinary discrete-time cellular automata, we allow only space-time configurations in which all 

switching times are natural numbers 0, 1, 2, The time is considered a switching time. If there 

is an e such that rj{c, t) is constant iov a — e <t < a then this constant value will be denoted by 

(2.2) ??(c,a-). 

The subconfiguration ^{D') of a configuration ^ defined on D D D' is the restriction of ^ to D'. 
Sometimes, we write 

V{V) 

for the sub-configuration over the space-time set V. 
A deterministic cellular automaton 

CA(Tr,C). 

is determined by a transition function Tr : S'^ ^ S and the set C of sites. We will omit C from the 
notation when it is obvious from the context. A space-time configuration is a trajectory of this 
automaton if 

7j{x,t) = Tr{r]{x-l,t-l),f]{x,t-l),7]{x + l,t-l)) 
holds for all x, t with t > 0. For a space-time configuration rj let us write 

(2.3) TF(77, X, t) = Trir^ix - 1, t), r,{x, t),Tj{x + 1, t)). 

Given a configuration ^ over the space C and a transition function, there is a unique trajectory r] 
with the given transition function and the initial configuration ri{-, 0) = ^. 

2.2. Fields of a local state. The space-time configuration of a deterministic cellular automaton 
can be viewed as a "computation". Moreover, every imaginable computation can be performed by 
an appropriately chosen cellular automaton function. This is not the place to explain the meaning 
of this statement if it is not clear to the reader. But it becomes maybe clearer if we point out 
that a better known model of computation, the Turing machine, can be considered a special cellular 
automaton. 

Let us deal, from now on, only with cellular automata in which the set S of local states consists 
of binary strings of some fixed length called the capacity of the sites. Thus, if the automaton 
has 16 possible states then its states can be considered binary strings of length 4. If ||S|| > 1 then 
the information represented by the state can be broken up naturally into parts. It will greatly help 
reasoning about a transition rule if it assigns different functions to some of these parts; a typical 
"computation" would indeed do so. Subsets of the set {0, . . . , ||§|| — 1} will be called fields. Some 
of these subsets will have special names. Let 

An = {0,... ,||S||-1}. 

If s = {s{i) : i e All)) is a bit string and F = {ii, . . . , ik} is a field with ij < ij+i then we will write 

s.F= (s(ii),... ,s{ik)) 
for the bit string that is called field F of the state. 

Example 2.1. If the capacity is 12 we could subdivide the interval [0, 11] into subintervals of lengths 
2,2,1,1,2,4 respectively and call these fields the input, output, mail coming from left, mail coming 
from right, memory and workspace. We can denote these as Input, Output, Mailj [j = —1, 1), Work 
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and Memory. If s is a state then s.Input denotes the first two bits of s, s.Maih means the sixth bit 
of s, etc. <C> 

Remark 2.2. Treating these fields differently means we may impose some useful restrictions on the 
transition function. We might require the following, calling Mail-i the "right-directed mail field": 

The information in Maii_i moves always to the right. More precisely, in a trajectory 77, 
the only part of the state r]{x, t) that depends on the state 'q{x — B,t — T) of the left 
neighbor is the right- directed mail field 'ri{x,t).AIail^i. This field, on the other hand, 
depends only on the right- directed mail field of the left neighbor and the workspace field 
r]{x, t — T).WoTk. The memory depends only on the workspace. 
Confining ourselves to computations that are structurc;d in a similar way make reasoning about them 
in the presence of faults much easier. Indeed, in such a scheme, the effects of a fault can propagate 
only through the mail fields and can affect the memory field only if the workspace field's state allows 
it. 

Fields are generally either disjoint or contained in each other. When we join e.g. the input fields 
of the different sites we can speak about the input track, like a track of some magnetic tape. 

2.2.1. Bandwidth and separability. Here we are going to introduce some of the fields used later in 
the construction. It is possible to skip this part without loss of understanding and to refer to it later 
as necessary. However, it may show to computer scientists how the communication model of cellular 
automata can be made more realistic by introducing a new parameter w and some restrictions on 
transition functions. The restrictions do not limit information-processing capability but 

• limit the amount of information exchanged in each transition; 

• limit the amount of change made in a local state in each transition; 

• restrict the fields that depend on the neighbors immediately; 

A transition function with such structure can be simulated in such a way that only a small part of 
the memory is used for information processing, most is used for storage. For a fixed bandwidth w, 
we will use the following disjoint fields, all with size w: 

(2.4) Inbuf =[0,w -1], Outbuf, Pointer. 
Let 

Buf = Inbuf U Outbuf, Memory = All \ Inbuf. 

Note that the fields denoted by these names are not disjoint. For a transition function Tr : S"^ — > S, 
for an integer [log ||§||]<u'<||§||,we say that Tr is separable with bandwidth w if there is a function 

(2.5) Tr^'") : {0,1}^'" ^ {0,1}^™ 

determining Tr in the following way. For states r_i,ro,ri, let a = ro. Pointer, (a binary number), 
then with 

(2.6) p = Tr('")(r_i.Buf , ro.{Buf U[a,a + w -1]), n.Buf), 
we define 

(2.7) Tr{r-i,ro,ri).{Buf U Pointer) = p.[0,3w - 1]. 

The value Tr(r_i , tq, ri) can differ from ro only in Buf U Pointer and in field [n,n + w — 1] where 
n = p.lAw, 5w — 1] (interpreted as an integer in binary notation) and then 

Tr(r_i, ro, ri).[n, n + w — 1] = p.[3w, Aw — I]. 
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It is required that only Inbuf depends on the neighbors directly: 

(2.8) Tr (r_i , ro , ri ) .Memory = Tr{Vac, tq , Vac). Memory, 
where Vac is a certain distinguished state. Let 

,rr/ X |1 if V. Memory = Tr(Vac,u,Vac).Memory, 
legal {u,v) = < 

I otherwise. 

Thus, sites whose transition function is separable with bandwidth w communicate with each other 
via their field Buf of size 2w. The transition function also depends on ro.[a,a + w — 1] where 
a = ro. Pointer. It updates ro.{Buf U Pointer) and ro.[n + w — 1] where n is also determined by 

Remark 2.3. These definitions turn each site into a small "random access machine". <0 

When designing a cellular automaton with small bandwidth for theoretical purposes, the guiding 
principle is to put almost everything into Buf, so that we do not have to worry about communication 
with neighbors. All the fields having to do with computation and administration will be small. The 
only exception is the bulk of the information that the cell is supposed to store. But even here, the 
redundant part of the information used for error-correcting purposes will be small and can be in 
Buf. 

2.3. Probabilistic cellular automata. A random space-time configuration is a pair (/i,??) where 
/Lt is a probability measure over some measurable space {fl,A) together with a measurable function 
ri{x,t,u}) which is a space-time configuration for all w e O. We will generally omit w from the 
arguments of rj. When we omit the mention of ji we will use Prob to denote it. If it does not lead to 
confusion, for some property of the form {r] G R}, the quantity : r]{-, -,10) Si?} will be written 
as usual, as 

We will denote the expected value of / with respect to by 

where we will omit /i when it is clear from the context. A function f{ri) with values 0, 1 (i.e. an 
indicator function) and measurable in A will be called an event function over A. Let W be any 
subset of space-time that is the union of some rectangles. Then 

A{W) 

denotes the a-algebra generated by events of the form 

{ ri{x, t) = s for ti < t < t2} 

for s e §, {x,ti) e W. Let 

At=A{C X [0,t]). 

A probabilistic cellular automaton 

PCA{P, C) 

is characterized by saying which random space-time configurations are considered trajectories. Now 
a trajectory is not a single space-time configuration (sample path) but a distribution over space-time 
configurations that satisfies the following condition. The condition depends on a transition matrix 
P(s, (r_i, ro, ri)). For an arbitrary space-time configuration 77, and space-time point {x, t), let 

(2.9) P(?7, s, X, t) = P(s, {r]{x - 1, t), r]{x, t),r,{x + 1, t))). 
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We will omit the parameter rj when it is clear from the context. The condition says that the random 
space-time configuration is a trajectory if and only if the following holds. Let xq, ■ ■ ■ ,Xn+i be 
given with = + 1. Let us fix an arbitrary space-time configuration C and an arbitrary event 
7i 9 C of positive probability in At-T- Then we require 

{n n+1 ^ 

fl t) = ax^,t)} \nnf] Mx^, i - 1) = C{x^,t - f )} I 
i=l 1=0 J 

n 

= l[P{V,C{x^,t),X^,t~l). 

i=l 

A probabilistic cellular automaton is noisy if P(s,r) > for all s,r. Bandwidth can be defined for 
transition probabilities just as for transition functions. 

Example 2.4. As a simple example, consider a deterministic cellular automaton with a "random 
number generator". Let the local state be a record with two fields, Det and Rand where Rand 
consists of a single bit. In a trajectory (/i, rj), the field rj.Det{x, t-\-l) is computed by a deterministic 
transition function from 'q{x — 1, t), rj{x^ t), ri{x + l,t), while 'q.Rand{x, t + I) is obtained by "coin- 
tossing" . <) 

A trajectory of a probabilistic cellular automaton is a discrete-time Markov process. If the set of 
sites consists of a single site then P(s,r) is the transition probability matrix of this so-called finite 
Markov chain. The Markov chain is finite as long as the number of sites is finite. 

2.4. Continuous-time probabilistic cellular automata. For later reference, let us define here 
(1-dimensional) probabilistic cellular automata in which the sites make a random decision "in each 
moment" on whether to make a transition to another state or not. These will be called continuous- 
time interacting particle systems. A systematic theory of such systems and an overview of many 
results available in 1985 can be found in [ [20|. Here, we show two elementary constructions, the 
second one of which is similar to the one in |f6t. The system is defined by a matrix R(s, r) > of 
transition rates in which all "diagonal" elements R(ro, (r_i,ro,ri)) are 0. 

2.4.1. Discrete-time approximation. Consider a generalization PCA(P, _B, (5, C) of probabilistic cel- 
lular automata in which the sites are at positions iB for some fixed B called the body size and integers 
I, and the switching times are at 0, 5, 2(5, 3(5, .. . for some small positive 5. Let Ms = PCA(P, I, 5) 
with P(s,r) = (5R(s,r) when s ^ tq and 1 — (5^^,_^j,^ R(s',r) otherwise. (This definition is sound 
when 5 is small enough to make the last expression nonnegative.) With any fixed initial configura- 
tion ?7(-, 0), the trajectories rjs of Ms will converge (weakly) to a certain random process rj which is 
the continuous-time probabilistic cellular automaton with these rates, and which we will denote 

CCA(R,C). 

The process defined this way is a Markov process, i.e. if we fix the past before some time io then 
the conditional distribution of the process after io will only depend on the configuration at time 
to- For a proof of the fact that rjs converges weakly to rj, see |2^, and the works quoted there. 
For a more general definition allowing simultaneous change in a finite number of sites, see [ pO[ . A 
continuous-time interacting particle system is noisy if R(s,r) > for all s ^ r^. 



2.5. Perturbation. 
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2.5.1. Discrete time. Intuitively, a deterministic cellular automaton is fault-tolerant if even after it 
is "perturbed" into a probabilistic cellular automaton, its trajectories can keep the most important 
properties of a trajectory of the original deterministic cellular automaton. We will say that a random 
space-time configuration {fi,ri) is a trajectory of the e- perturbation 



of the transition function Tr if the following holds. For all a;o, • . . Sn+i, t with Xi^i = Xi + I and 
events H in At-i with fJ,{H) > 0, for all < «i < ■ • • < ik < n, 



Note that CA^{Tr, C) is not a probabilistic cellular automaton. If we have any probabilistic cellular 
automaton PCA{P, C) such that P(s, r, C) < 1 — e whenever s = Tr(r) then the trajectories of this 
are trajectories of CA^{Tr, C); however, these do not exhaust the possibilities. We may think of the 
trajectory of a perturbation as a process created by an "adversary" who is trying to defeat whatever 
conclusions we want to make about the trajectory, and is only restricted by the inequalities that the 
distribution of the trajectory must satisfy. 

2.5.2. Continuous time. By the e -perturbation of a continuous-time interacting particle system with 
transition rates given by R(s, r), we understand the following: in the above construction of a process, 
perturb the matrix elements R(s, r) by some arbitrary amounts smaller than e. Note that this is a 
more modest kind of perturbation since we perturb the parameters of the process once for all and 
the perturbed process is again a continuous-time interacting particle system. 

2.5.3. Remembering a few bits. Suppose that the bit string that is a local state has some field F (it 
can e.g. be the first two bits of the state). We will say that Tr remembers field F if there is an e > 
such that for each string s S {0, l}'^' there is a configuration such that for an infinite C, for all 
trajectories (/i, 77) of the e-perturbation CAg(Tr, C) with rj{-, 0) — £,s, for all x, t we have 



We define similarly the notions of remembering a field for a probabilistic transition matrix P and a 
probabilistic transition rate matrix R. 
One of the main theorems in [|o| says: 

Theorem 2.5 (1-dim, non-ergodicity, discrete time). 

There is a one- dimensional transition function that remembers a field. 

One of the new results is the following 

Theorem 2.6 (1-dim, non-ergodicity, continuous time). 

There is a one- dimensional transition-rate matrix that remembers a field. 



CA,(Tr,C) 



f]Mx^^,t)^Trir,,x,^,t-l)} \ nn f]{7^{x„t ^ I) = s,} \ < 




fi{7j{x,t).F = s}> 2/3. 
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3. Codes 

3.1. Colonies. For the moment, let us concentrate on the task of remembering a single bit in a field 



called Main_bit of a cellular automaton. We mentioned in Subsection 1.2 that in one dimension, 
even this simple task will require the construction and maintenance of some non-local organization, 
since this is the only way a large island can be eliminated. 

This organization will be based on the concept of colonies. Let a; be a site and Q a positive 
integer. The set of Q sites x + i for is [0, Q — 1] will be called the Q-colony with base x, and site 
X + i will be said to have address i in this colony. Let us be given a configuration ^ of a cellular 
automaton M with state set §. The fact that ^ is "organized into colonies" will mean that one can 
break up the set of all sites into non-overlapping colonies of size Q, using the information in the 
configuration ^ in a translation-invariant way. This will be achieved with the help of an address field 
Addr which we will always have when we speak about colonies. The value £^{x).Addr is a binary 
string which can be interpreted as an integer in [0, Q — 1]. Generally, we will assume that the Addr 
field is large enough (its size is at least logQ). Then we could say that a certain Q-colony C is a 
"real" colony of ^ if for each element y of C with address i we have £^{y).Addr = i. In order to allow, 
temporarily, smaller address fields, let us actually just say that a Q-colony with base a; is a "real" 
colony of the configuration ^ if its base is its only element having Addr = 0. 

Cellular automata working with colonies will not change the value of the address field unless it 
seems to require correction. In the absence of faults, if such a cellular automaton is started with 
a configuration grouped into colonics then the sites can always use the Addr field to identify their 
colleagues within their colony. 

Grouping into colonies seems to help preserve the Main_bit field since each colony has this infor- 
mation in Q-fold redundancy. The transition function may somehow involve the colony members in a 
coordinated periodic activity, repeated after a period of U steps for some integer U, of restoring this 
information from the degradation caused by faults (e.g. with the help of some majority operation). 

Let us call U steps of work of a colony a work period. The best we can expect from a transition 
function of the kind described above is that unless too many faults happen during some colony work 
period the Main_bit field of most sites in the colony will always be the original one. Rather simple 
such transition functions can indeed be written. But they do not accomplish qualitatively much 
more than a local majority vote for the Main_bit field among three neighbors. 

Suppose that a group of failures changes the original content of the Main-bit field in some colony, 
in so many sites that internal correction is no more possible. The information is not entirely lost 
since most probably, neighbor colonies still have it. But correcting the information in a whole colony 
with the help of other colonies requires organization reaching wider than a single colony. To arrange 
this broader activity also in the form of a cellular automaton we use the notion of simulation with 
error-correction . 

Let us denote by Mi the fault-tolerant cellular automaton to be built. In this automaton, a 
colony C with base x will be involved in two kinds of activity during each of its work periods. 

Simulation: Manipulating the collective information of the colony in a way that can be in- 
terpreted as the simulation of a single state transition of site x of some cellular automaton 

M2. 

Error-correction: Using the collective information (the state of x in M2) to correct each site 
within the colony as necessary. 

Of course, even the sites of the simulated automaton M2 will not be immune to errors. They must 
also be grouped into colonies simulating an automaton M3, etc.; the organization must be a hierarchy 
of simulations. 
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Reliable computation itself can be considered a kind of simulation of a deterministic cellular 
automaton by a probabilistic one. 

3.2. Block codes. 

3.2.1. Codes on strings. The notion of simulation relies on the notion of a code, since the way the 
simulation works is that the simulated space-time configuration can be decoded from the simulating 
space-time configuration. A code, ip between two sets R,S is, in general, a pair where 
: i? — > 5 is the encoding function and if* : S ^ Ris the decoding function and the relation 

<P*{^*{r)) =r 

holds. A simple example would be when R = {0,1}, S = R^, (p*{r) = (r,r,r) while ip*{{r,s,t)) is 
the majority of r,s,t. 

This example can be generalized to a case when §i,§2 are finite state sets, R = S2, S = 
where the positive integer Q is called the block size. Such a code is called a block code. Strings 
of the form ip^ (r) are called codewords. The elements of a codeword s = (p* (r) are numbered as 
s(0), . . . , s{Q — 1). The following block code can be considered the paradigmatic example of codes. 

Exam-pie 3.1. Suppose that Si = §2 = {0,1}^^ is the state sot of both cellular automata M\ and 
M2. Let us introduce the fields s.Addr and s.Info of a state r ~ (sq, ■ ■ ■ , sn) in §1. The Addr 
field consists of the first 5 bits sq, . . . , S4, while the Info field is the last bit sn. The other bits 
do not belong to any named field. Let Q = 31. Thus, we will use codewords of size 31, formed of 
the symbols (local states) of Mi, to encode local states of M2. The encoding funcion p^ assigns a 
codeword (p*{r) = (s(0), . . . , s(30)) of elements of Si to each element r of S2. Let r = (ro, ... , rn). 
We will set s{i).Info = rj for i = 0, . . . , 11. The 5 bits in s{i).Addr will denote the number i in 
binary notation. This did not determine all bits of the symbols s(0), . . . , s(30) in the codeword. In 
particular, the bits belonging to neither the Addr nor the Info field are not determined, and the 
values of the Info field for the symbols s{i) with i ^ [0, 11] are not determined. To determine </J*(r) 
completely, we could set these bits to 0. 

The decoding function is simpler. Given a word s = (s(0), . . . , s(30)) we first check whether it is 
a "normal" codeword, i.e. it has s(0).Addr = and s{i).Addr 7^ for i ^ 0. If yes then , r = ip*{s) 
is defined by = s{i).Info for i G [0,11], and the word is considered "accepted". Otherwise, 

(s) = • • • and the word is considered "rejected" . 

Informally, the symbols of the codeword use their first 5 bits to mark their address within the 
codeword. The last bit is used to remember their part of the information about the encoded symbol. 



For two strings u, v, we will denote by 
(3.1) uUv 
their concatenation. 

Example 3.2. This trivial example will not be really used as a code but rather as a notational 
convenience. For every symbol set §1, blocksize Q and §2 = there is a special block code lq 
called aggregation defined by 

LQ.MO), ...,s{Q- 1))) = s(0) U • • • U s(Q - 1), 



and Lq defined accordingly. Thus, tg* is essentially the identity: it just aggregates Q symbols of 
Si into one symbol of S2. We use concatenation here since we identify all symbols with binary 
strings. O 
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—I I I I I I I I l_ 



Figure 1 . Three neighbor colonics with their tracks 



The codes (p between se ts R , S used in our simulations will have a feature similar to the acceptance 



and rejection of Example 3.1. The set R will always have a special symbol called Vac, the vacant 
symbol. An element s G S* will be called accepted by the decoding if tf*{s) ^ Vac, otherwise it is 
called rejected. 

3.3. Generalized cellular automata (abstract media). A block code Lp could be used to define 
a code on configurations between cellular automata Mi and M2 ■ Suppose that a configuration ^ of 
M2 is given. Then we could define the configuration ^, = <p*(^)of Mi by setting for each cell a; of ^ 
and < i < Q, 

£,*{Qx + i) = ip^,{£,{x))(i). 

The decoding function would be defined correspondingly. This definition of decoding is, however, 
unsatisfactory for our purposes. Suppose that is obtained by encoding a configuration ^ via Lp^ 
as before, and C is obtained by shifting f*: C^{x) = — 1). Then the decoding of C, will return all 
vacant values since now the strings {C,{Qx), • • • , C{Qx + Q — 1)) are not "real" colonies. However, 
it will be essential for error correction that whenever parts of a configuration form a colony, even 
a shifted one, the decoding should notice it. With our current definition of cellular automata, the 
decoding function could not be changed to do this. Indeed, if C,* is the configuration decoded from 
^ then C*(0) corresponds to the value decoded from ((^(O),--- , C(Q — and C*(l) to the value 
decoded from {C,{Q)r-' XC^Q ~ !))■ There is no site to correspond to the value decoded from 

(C(l),--- ,C(Q)); 

Our solution is to generalize the notion of cellular automata. Let us give at once the most general 
definition which then we will specialize later in varying degrees. The general notion is an abstract 
medium 

AMed(§, C, ConEgs, Evols, Trajs). 

Here, Configs is the set of functions ^ : C ^ S that are configurations and Evols is the set of 
functions 77 : C x [0, 00—] — > § that are space-time configurations of the abstract medium. Further, 
Trajs is the set of random space-time configurations (/i, 77) that are trajectories. In all cases that we 
will consider, the set Trajs will be given in a uniform way, as a function Traj(C) of C. The sets §, C, 
Configs and Evols are essentially superfiuous since the set Trajs defines them implicitly — therefore 
we may omit them from the notation, so that eventually we may just write 

AMed{Trajs). 

Given media Mi, M2 for the same §, C, we will write 



Ml C M2 
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if Trajsi C Trajs2- Let 

Ml nM2 

be the medium whose trajeetory set is Trajs^ PI Tiajs^- 
Let us now consider some special cases. 

3.3.1. Cellular abstract media. All abstract media in this paper will be cellular, the sets Configs 
and Evols will be defined in the way given here. The set § of local states will always include a 
distinguished state called the vacant state Vac. If in a configuration ^ we have ^(.t) 7^ Vac then 
we will say that there is a cell at site x in ^. We will have a positive number B called the body 
size. In ordinary cellular automata, B = 1. For a site x, interval [x,x + B—] will be called the 
body of a possible cell with base x. A function ^ : C — »■ § is a configuration if the cells in it have 
non-intersecting bodies. 

Remark 3.3. Since not each site will be occupied by a cell, it is not even important to restrict the 
set of sites to integers; but we will do so for convenience. ■()> 

A function 77 : C x [0, 00—] ^ § is a space-time configuration if 

(a) T]{-,t) is a space configuration for each t; 

(b) ri{x,t) is a right-continuous function of t: 

(c) Each finite time interval contains only finitely many switching times for each site x; 

A dwell period is a tuple {x,s,t\,t2) such that x is a site, s is a nonvacant state, and Q <t\ < t2 
are times. The rectangle [.x, x + B—\ x [ti, t2—\ is the space-tim,e body of the dwell period. It is easy 
to see that the dwell periods in a space-time configuration have disjoint bodies. This completes the 
definition of the sets Configs and Evols in cellular abstract media, the only kind of media used in 
this paper. Therefore from now on, we may write 

AMed{C,Trajs, B). 

We may omit any of the arguments if it is not needed for the context. 

We will speak of a lattice configuration if all cells are at sites of the form iB for integers i. We 
can also talk about lattice space-time configurations: these have space-time bodies of the form 

[iB, {i + 1)B-] X \jT, {j + 1)T-] 

for integers i, j. 

A special kind of generalized cellular automaton is a straightforward redefinition of the original 
notion of cellular automaton, with two new but inessential parameters: a deterministic cellular 
automaton 

CA{Tr,B,T,C) 

is determined by B,T > and a transition function Tr : E>^ ^ §. We may omit some obvious 
arguments from this notation. A lattice space-time configuration rj with parameters B,T is a 
trajectory of this automaton if 

r]{x, t) = Tr{r]{x -B,t- T),r]{x, t - T), r]{x + B,t- T)) 

holds for all x, t with t>T. For a space-time configuration r/ let us write 

(3.2) IV(J7, X, t, B) = Tr{r]{x - B, t),r]{x, t), r]{x + B, t)). 

We will omit the argument B when it is obvious from the context. Probabilistic cellular automata 
and perturbations are generalized correspondingly as 

PCA{P,B,T,C), CA,(Tr,B,T,C). 
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From now on, whenever we talk about a deterministic, probabilistic or perturbed cellular automaton 
we understand one also having parameters B, T. 

We will have two kinds of abstract medium that are more general than these cellular automata. 
We have a constant-period medium if in all its trajectories, all dwell period lengths are multiples of 
some constant T. Otherwise, we have a variable-period medium. 

3.3.2. Block codes between cellular automata. In a cellular abstract medium with body size B, a 
colony of size Q is defined as a set of cells x-\-iB ior i G [0, Q — 1] . Thus, the union of the cell bodies 
of body size i? in a colony of size Q occupies some interval [x,x-\- QB—] . A block code will be called 
overlap-free if for every string (s(0), . . . s{n — 1)), and aWi < n~ Q, if both (s(0), . . . , s{Q — 1)) and 
(s(i + 1), . . . ,s{i-\-Q — 1)) are accepted then i > Q. In other words, a co de is overlap-free if two 
accepted words cannot overlap in a nontrivial way. The code in Example |3.l| is overlap-free. All 
block-codes considered from now on will be overlap-free. Overlap-free codes are used, among others, 
in HI. 

3.3.3. Codes on configurations. A block code of block size Q can be used to define a code on 
configurations between generalized abstract media Mi and M2. Suppose that a configuration ^ of 
M2, which is an AMed{QB), is given. Then we define the configuration = f*{0 of Mi, which is 
an AMed{B), by setting for each cell x of ^ and < z < Q, 

^^{x + iB) = ip.,{^{x)){i). 

Suppose that a configuration ^ of Mi is given. We define the configuration ^* — (p*{^) of M2 as 
follows: for site x, we set £,*{x) = (p*{s) where 

(3.3) s = (^(x), e(x + B),... ,ax + iQ- l)B)). 

If ^ is a configuration with ^ = (^*(C) then, due to the overlap-free nature of the code, the value ^*{x) 
is nonvacant only at positions x where C(x) is nonvacant. If ^ is not the code of any configuration 
then it may happen that in the decoded configuration ip* (^) , the cells will not be exactly at a distance 
QB apart. The overlap-free nature of the code garantees that the distance of cells in (p*{£,) is at 
least QB even in this case. 

3.4. Block simulations. Suppose that Mi and M2 are deterministic cellular automata where M,; = 
CA{Tri, Bi, Ti), and (p is a block code with 

Bi = B, B2 = QB. 



The decoding function may be as simple as in Example 3T: there is an Info track and once the 
colony is accepted the decoding function depends only on this part of the information in it. 
For each space-time configuration jj of Mi, we can define rj* — y:>*{ri) of M2 by setting 

(3.4) ry*(.,i)-^*(rK-,i)). 

We will say that the code (p is a simulation if for each configuration ^ of M2, for the trajectory (/i, rj) 
of Ml, such that r](-,0,uj) = (p*{£,) for almost all to, the random space-time configuration {n,r]*) is 
a trajectory of M2. (We do not have to change fj, here since the uj in 77* (x, t, lo) is still coming from 
the same space as the one in r]{x,t,oj).) 

We can view ip^, as an encoding of the initial configuration of M2 into that of Mi . A space-time 
configuration rj of Mi will be viewed to have a "good" initial configuration ?7(-,0) if the latter is 
<P>*{£,) for some configuration of M2. Our requirements say that from every trajectory of Mi with 
good initial configurations, the simulation-decoding results in a trajectory of M2. 
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Let US show one particular way in which the code can be a simulation. For this, the function 
Tri must behave in a certain way which we describe here. Assume that 

Ti =T, = UT 

for some positive integer U called the work period size. Each cell of Mi will go through a period 
consisting of U steps in such a way that the Info field will be changed only in the last step of this 
period. The initial configuration ?y(-,0) = (pt:{£,) is chosen in such a way that each cell is at the 
beginning of its work period. By the nature of the code, in the initial configuration, cells of Mi are 
grouped into colonies. 

Once started from such an initial configuration, during each work period, each colony, in coop- 
eration with its two neighbor colonies, computes the new configuration. With the block code in 



Example 3.1, this may happen as follows. Let us denote by r_,ro,r+ the value in the first 12 bits 
of the Info track in the left neighbor colony, in the colony itself and in the right neighbor colony 
respectively. First, r_ and are shipped into the middle colony. Then, the middle colony computes 
s — Tr2(r_i, ro, ri) where Tr2 is the transition function or M2 and stores it on a memory track. 
(It may help understanding how this happens if we think of the possibilities of using some mail, 
memory and workspace tracks.) Then, in the last step, s will be copied onto the Info track. 
Such a simulation is called a block simulation. 

Example 3.4. Let us give a trivial example of a block simulation which will be applied, however, 
later in the paper. Given a one-dimensional transition function Tr(x,y, z) with state space S, we 
can define for all positive integers Q an aggregated transition function Tr''^ {u,v,w) as follows. The 
state space of of Tr'^ is S*^. Let Vj — {rj{0), . . . ,rj{Q — 1)) for j = —1,0, 1 be three elements of S*^. 
Concatenate these three strings to get a string of length 3Q and apply the transition function Tr 
to each group of three consecutive symbols to obtain a string of length SQ — 2 (the end symbols do 
not have both neighbors). Repeat this Q times to get a string of Q symbols of §: this is the value 
of TrO (r_i,ro,ri). 

For Ml = CA{S,Tr,B,T) and M2 = CA{§^ ,Tr^ ,QB,QT), the aggregation code tg defined 
in Example |3.2| will be a block simulation of M2 by Mi with a work period consisting oi U ~ Q 
steps. If along with the transition function Tr, there were some fields F,G, . . . C All also defined 
then we define, say, the field F in the aggregated cellular automaton as [jfSf^^{F + i||S||). Thus, if 
r = r(0) U • • • U r{Q — 1) is a state of the aggregated cellular automaton then r.F — r{0).F U r(l).F U 
•••Ur(g-1).F. ^ 

A transition function Tr is universal if for every other transition function Tr' there are Q, U and 
a block code (p such that <y9 is a block simulation of CA{Tr' , Q, U) by CA{Tr, 1, 1). 

Theorem 3.5 (Universal cellular automata). There is a universal transition function. 

Sketch of proof: This theorem is proved somewhat analogously to the theorem on the existence of 
universal Turing machines. If the universal transition function is Tr then for simulating another 
transition function Tr', the encoding demarcates colonies of appropriate size with Addr = 0, and 
writes a string Table that is the code of the transition table of Tr' onto a special track called Prog in 
each of these colonies. The computation is just a table-look-up: the triple (r_, ro, r+) mentioned in 
the above example must be looked up in the transition table. The transition function governing this 
activity does not depend on the particular content of the Prog track, and is therefore independent 
of Tr'. For references to the first proofs of universality (in a technically different but similar sense), 
see i, □ 



Note that a universal cellular automaton cannot use codes similar to Example 3.1. Indeed, in 



that example, the capacity of the cells of Mi is at least the binary logarithm of the colony size, since 
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each colony cell contained its own address within the colony. But if Afi is universal then the various 
simulations in which it participates will have arbitrarily large colony sizes. 

The size Q of the simulating colony will generally be very large also since the latter contains 
the whole table of the simulated transition function. There are many special cellular automata M2, 
however, whose transition function can be described by a small computer program and computed 
in relatively little space and time (linear in the size ||§2||)- The universal transition function will 
simulate these with correspondingly small Q and U . We will only deal with such automata. 

3.5. Single-fault-tolerant block simulation. Here we outline a cellular automaton Mi that 
block-simulates a cellular automaton M2 correctly as long as at most a single error occurs in a 
colony work period of size U . The outline is very informal: it is only intended to give some frame- 
work to refer to later: in particular, we add a few more fields to the fields of local states introduced 
earlier. For simplicity, these fields are not defined here in a way to make the cellular automaton 



separable in the sense defined in 2.2.1. They could be made so with a few adjustments but we want 
to keep the introduction simple. 

The automaton Mi is not universal, i.e. the automaton M2 cannot be chosen arbitrarily. Among 
others, this is due to the fact that the address field of a cell of Mi will hold its address within its 
colony. But we will see later that universality is not needed in this context. 

The cells of Mi will have, besides the Addr field, also a field Age. If no errors occur then in the 
i-th step of the colony work period, each cell will have the number i in the field Age. There are also 
fields called Maii, Jnfo, Work, Hold, Prog. 

The Info field holds the state of the represented cell of M2 in three copies. The Hold field will 
hold parts of the final result before it will be, in the last step of the work period, copied into Info. 
The role of the other fields is clear. 

The program will be described from the point of view of a certain colony C. Here is an informal 
description of the activities taking place in the first third of the work period. 

1. From the three thirds of the Info field, by majority vote, a single string is computed. Let us 
call it the input string. This computation, as all others, takes place in the workspace field 
Work; the Info field is not affected. The result is also stored in the workspace. 

2. The input strings computed in the two neighbor colonies are shipped into C and stored in the 
workspace separately from each other and the original input string. 

3. The workspace field behaves as a universal automaton, and from the three input strings and 
the Prog field, computes the string that would be obtained by the transition function of M2 
from them. This string will be copied to the first third of the Hold track. 

In the second part of the work period, the same activities will be performed, except that the 
result will be stored in the second part of the Hold track. Similarly with the third part of the work 
period. In a final step, the Hold field is copied into the Info field. 

The computation is coordinated with the help of the Addr and Age fields. It is therefore important 
that these are correct. Fortunately, if a single fault changes such a field of a cell then the cell can 
easily restore it using the Addr and Age fields of its neighbors. 

It is not hard to see that with such a program (transition function), if the colony started with 
"perfect" information then a single fault will not corrupt more than a third of the colony at the end 
of the work period. On the other hand, if two thirds of the colony was correct at the beginning of 
the colony work period and there is no fault during the colony work period then the result will be 
"perfect". 



3.6. General simulations. The main justification of the general notion of abstract media is that 
it allows a very general definition of simulations: a simulation of abstract medium M2 by abstract 



RELIABLE CELLULAR AUTOMATA 



21 



medium Mi is given by a pair 

where is a mapping of the set of space-time configurations of Mi into those of M2 (the decoding), 
and (f^, is a mapping of the set of configurations of M2 to the set of configurations of Mi (the encoding 
for initiaUzation) . Let us denote 

We require, for each trajectory rj for which the initial configuration has the encoded form ri(-, 0) = 
f *{£,), that rj* is a trajectory of M2 with ?7*(-,0) — ^. 

A simulation will be called local, if there is a finite space-time rectangle V* = I x [—u, 0] such that 
^*{r]){'w, t) depends only on 'q{{w, t) + V*). Together with the shift-invariance property, the locality 
property implies that a simulation is determined by a function defined on the set of configurations 



over V* . All simulations will be local unless stated otherwise. Corollary 7.3 gives an example of 
non-local simulation. 

If w = then the configuration r]*{-,t) depends only on the configuration T]{-,t). In this case, the 
simulation could be called "memoryless" . For a memoryless simulation, the simulation property is 



identical to the one we gave at the beginning of Subsection 3.4. Our eventual simulations will not 
be memoryless but will be at least non- anticipating: we will have u > 0, i.e. the decoding looks back 
on the space-time configuration during [t — u,t], but still does not look ahead. In particular, the 
value of ?7*(-,0) depends only on 7y(-,0) and therefore the simulation always defines also a decoding 
function ip* on space-configurations. From now on, this decoding function will be considered part 
of the definition of the simulation, i.e. we will write 

Suppose that a sequence Mi, M2, ... of abstract media is given along with simulations $1, $2, ■ ■ • 
such that is a simulation of Mk+i by M^. Such a system will be called an amplifier. Amplifiers 
are like renormalization groups in statistical physics. Of course, we have not seen any nontrivial 
example of simulation other than between deterministic cellular automata, so the idea of an amplifier 
seems far-fetched at this moment. 

3.6.1. Simulation between perturbations. Our goal is to find nontrivial simulations between cellular 
automata Mi and M2, especially when these are not deterministic. If Afi,M2 are probabilistic 
cellular automata then the simulation property would mean that whenever we have a trajectory 
(/U, r?) of Ml the random space-time configuration rf decoded from rj would be a trajectory of M2. 
There are hardly any nontrivial examples of this sort since in order to be a trajectory of M2, the 
conditional probabilities of (p* (77) must satisfy certain equations defined by P2 , while the conditional 
probabilitiees of r] satisfy equations defined by Pi. 

There is more chance of success in the case when Mi and M2 are perturbations of some deter- 
ministic cellular automata since in this case, only some inequalities must be satisfied. The goal of 
improving reliability could be this. For some universal transition function Tr2, and at least two dif- 
ferent initial configurations {i = 0, 1), find Tri, Q, U, c with Bi =5,^2 = BQ, Ti = T, T2 = TU 
and a block simulation $1 such that for all e > 0, if £1 = e, £2 = ce^ and Mk is the perturbation 

CA,,{TYk,Bk,n,Z) 

then $1 is a simulation of M2 by Mi. The meaning of this is that even if we have to cope with the 
fault probability e the simulation will compute Tr2 with a much smaller fault probability ce^ . The 
hope is not unreasonable since in Subsection |3.5| , we outlined a single-fault-tolerant block simulation 
while the probability of several faults happening during one work period is only of the order of 
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{QUe)'^. However, it turns out that the only simply stated property of a perturbation that survives 
noisy simulation is a certain initial stability property (see below). 

3.6.2. Error- correction. Even if the above goal can be achieved, the reason for the existence of the 
simulated more reliable abstract medium is to have feedback from it to the simulating one. The 
nature of this feedback will be defined in the notion of error-correction to whose definition now we 
proceed. Let us call the set 

(3.5) Eo = {0, !,#,*} 

the standard alphabet. Symbol ^ will be used to delimit binary strings, and * will scirve as a "don't- 
care" symbol. Each field F of a cell state such that the field size is even, can be considered not only 
a binary string but a string of (half as many) symbols in the standard alphabet. If r, s are strings 
in (So)" then 

r ^ s 

will mean that s{i) = r{i) for all < i < n such that r{i) ^ *. Thus, a don't-care symbol r{i) 
imposes no restriction on s in this relation. There will be two uses of the don't-care symbol. 

• The more important use will come in defining the code used for error-correction in a way that 
it requires the correction of only those parts of the information in which correction is desirable. 

• The less important use is in defining the notion of "monotonic output" : namely, output that 
contains more and more information as time proceeds. This is convenient e.g. for continuous- 
time cellular automata, where it is difficult to say in advance when the computation should 
end. 

For codes y * , ■^^ , we will write 

V'* ^ V* 

if for all ,s we have ?/'*(s) ^ ip*{s). 

Let us define the notion of error-correction. Let $ = ((p*, <^*, $*) be a simulation whose encoding 
is a block code with block size Q, between abstract cellular media Mj {i = 1,2). Let Tj > 
(i = 1,2) be some parameters, and i^** ^ a block code of blocksize Q. We say that $ has the 
e- error- correction property with respect to (/5**,Ti,T2 if the following holds for every configuration 
^ of M2 and every trajectory (/x, ry) of Mi with r]{-,0) = <f*{^). 

Let rj* = $*(?7) and let xi,X2 be sites where xi has address a in the Q-colony with base X2, let 
to be some time. Let £ be the event that r]*{x2, •) is nonvacant during [to — T2/3,to] and let £' be 
the event that for each t in [to — Ti/3, to] there is a t' in [to —T2/3, t] with 

'P**iri*ix2,t'))ia) 'n{xi,t). 

Then Prob{ £ n -.£'}< e. 

Informally, this means that for all xi,X2,a in the given relation, the state rj{xi,t) is with large 
probability what we expect by encoding some r]*{x2,t') via and taking the a-th symbol of the 
codeword. Error-correction is only required for a code ^ ip^ since (p* determines the value of 
many fields as a matter of initialization only: these fields need not keep their values constant during 
the computation, and therefore t^** will assign don't-care symbols to them. The code (p** will thus 
generally be obtained by a simple modification of . 

Example 3.6. Let 



Bi = l, B2=Q, Ti = l, T2 = U 
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to - Ti/3 



t - Ti/3 
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X2 + QB 



time 



to - T2/3 



Figure 2. Error correction 



for some U > Q. Assume that for k = 1,2, our media are cellular generalized media with body 
sizes -Bfe and state spaces Sk- Assume further that M2 has at least a field F2 with IF2I < Q/3 and 
Ml has at least the fields Fi,Addr,Age, with |Fi| = 2. For a state s € S2, let the string 

s' = <^*(s) eSf 

be defined as follows. Take the binary string S.F2, repeat it 3 times, pad it with *'s to a string of 
size Q of the standard alphabet: let this be a string (/(O), . . . , f{Q — 1)). Now for each address b, 
let 

s'{b).Fi = f{b), s'{b).Addr = b, s'{b).Age = 0, 

and let all other fields of s'{b) be filled with *'s. The definition of s" = (/?*,(s) starts as the definition 
of s' with the only difference that set s"{b).Age = *. Thus, the code </?*(s) encodes a redundant 
version of S.F2 onto the Fi track of the block s' and initializes the Addr and Age tracks to the 
values they would have at the beginning of a work period. The code 99*, (,s) loaves all tracks other 
than Fi and Addr undetermined, since it will have to be compared with the state of the colony also 
at times different from the beginning of the work period. 
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Suppose that an amplifier {Mk,^k)k>i is given along with the sequences (pk**,Tk, e'^. We will 
call this structure an error- correcting amplifier if for each fc, the simulation $fc has the e'^- error 
correction property with respect to (fk**, Tfe, Tk+i- 

3.7. Remembering a bit: proof from an amplifier assumption. The following lemma will be 
proved later in the paper. 

Lemma 3.7 (Initially Stable Amplifier). We can construct the following objects, for k = 1,2, . . . . 

(a) Media over state space E>k, simulations $fc ~ {(fi^i,,t^^*,^^*) and sequences tpk**, 7fc 
forming an e'^- error correcting amplifier with X^fc^fc l/Q- 

(b) (Initial stability) Parameters Sk,Bi with X^fc^fc ^ 1/6; transition function Tri and two config- 
urations ^0,^1 such that, defining — ^u^^ — 'p''*{£,u) /o*" u = 0,1, we have 

Ml = CA,,{Tri,Bi,Ti,Z). 

Further, for each k,u, for each trajectory rj of Mk with ri{-,0) = for all t < Tk, for each site 
X, we have 

Prob{ ■q{x, t) ^ ■q{x, 0) } < ek- 

(c) Parameters Qk such that the codes ipk* are block codes with block size Qk- 

(d) (Broadcast) Fields F^ for the state spaces Sk such that for each k, for each address a G [0, Qfc — 1] 
and state s G Sfe+i, for each u £ {0, 1} and site x we have 

(3.6) C^(x).F'= = u, 

(3.7) s.F''+^ <^k*M{a).FK 



Equation (3.6) says for the configurations (obtained by decoding from the initial configuration 



^„) in each cell of medium Alk, field Fk has value u. Equation (3.7) says the following. Assume for 
symbol s G Sfc+i we have s.Fk+i = u ^ *. Then the encoding function 1^9*, encodes s into a colony 
of Qk symbols ro, . . . , tq^^i such that for each a, we have Sa-Fk = u. Thus, the Fk+i field of s gets 



"broadcast" into the Fk field of each symbol of the code of s. This way, even if property (|3.6|) were 



assumed only for a fixed level k, property would (3.7) imply it for all i < fc 



Let us use this lemma to prove the first theorem. 



Proof of Theorem |^. j| . Let us use the amplifier defined in the above lemma. Let 77^ be a trajectory 
of the medium Mi with initial configuration Let 77'"' be defined by the recursion rf^^ = ^kiv'')- 
Let {xi, ti) be a space-time point in which we want to check ri{xi, ti).F^ = u. There is a sequence of 
points xi, X2, ■ ■ ■ such that Xk+i is a cell of ri''''^^{-, 0) containing Xk in its body with some address bk- 
There is a first n with ti < T„/3. Let Tk be the event that j^^{xk,t).F^ = u for t in [ti — Tfc/3,ii]. 
The theorem follows from the bounds on Ek and e'l and from 

n-1 

(3.8) Prob{ ^{Ti n • • • n ^„) } < e„ + ^ e'^'. 

k=l 

To prove this inequality, use 

n-1 

^{Ti r\■■■c^Tn) = ^Tni^\J {^Tk n Tk+i). 

k=l 

By the construction, r7"(a;„, 0).F" = u. Since the duration of [0,ti] is less than T„ we have 
Prob{ ?7"(a;n, ti) 7^ 77"(a;„,0)} < e„ by the initial stability property, proving Prob{ -iJF„ } < e„. 
The error-correction property and the broadcast property imply Prob{ J-k+i fl -iJ-k } < £fc. D 
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Figure 3. Fields of a cell simulated by a colony 



4. Hierarchy 

4.1. Hierarchical codes. The present section may seem a long distraction from the main course 
of exposition but many readers of this paper may have difficulty imagining an infinite hierarchical 
structure built into a configuration of a cellular automaton. Even if we see the possibility of such 
structures it is important to understand the great amount of flexibility that exists while building it. 
Formally, a hiearchy will be defined as a "composite code" . Though no decoding will be mentioned in 
this subsection, it is still assumed that to all codes (p* mentioned, there belongs a decoding function 
if* with ip*{ipt,{x)) = X. 

4.1.1. Composite codes. Let us discuss the hierarchical structure arising in an amplifier. If (fjip are 
two codes then ipo^p is defined by {^potp)^,{£^) = and (</3 0'i/))*(^) — ip* {ip* . It is assumed 
that ^ and ( are here configurations of the appropriate cellular automata, i.e. the cell body sizes are 
in the corresponding relation. The code (p o tjj is called the composition of ip and ip. 

For example, let Mi, M2, M3 have cell body sizes 1, 31, 31^ respectively. Let us use the code ip from 
Example |3.l| . The code p^ — p o p maps each cell c of M3 with body size 31^ into a "supercolony" 
of 31 • 31 cells of body size 1 in Mi. Suppose that C, = is a configuration obtained by encoding 

from a lattice configuration of body size 31^ in M-^, where the bases of the cells are at positions 
—480 + 31^1. (We chose -480 only since 480 = (31^ — l)/2 but we could have chosen any other 
number.) Then Q can be broken up into colonies of size 31 starting at any of those bases. Cell 55 
of Ml belongs to the colony with base 47 = —480 + 17 • 31 and has address 8 in it. Therefore the 
address field of C(55) contains a binary representation of 8. The last bit of this cell encodes the 
8-th bit the of cell (with base) 47 of M2 represented by this colony. If we read together all 12 bits 
represented by the Info fields of the first 12 cells in this colony we get a state C*(47) (we count from 
0). The cells with base —15 + ilj for j G Z with states C*(^15 + 31j) obtained this way are also 
broken up into colonies. In them, the first 5 bits of each state form the address and the last bits 
of the first 12 cells, when put together, give back the state of the cell represented by this colony. 
Notice that these 12 bits were really drawn from 31^ cells of Mi. Even the address bits in C*(47) 
come from different cells of the colony with base 47. Therefore the cell with state C(55) does not 
contain information allowing us to conclude that it is cell 55. It only "knows" that it is the 8-th cell 
within its own colony (with base 47) but does not know that its colony has address 17 within its 
supercolony (with base —15 • 31) since it has at most one bit of that address. 

4.1.2. Infinite composition. A code can form composition an arbitrary number of times with itself or 
other codes. In this way, a hierarchical, i.e. highly nonhomogenous, structure can be defined using 
cells that have only a small number of states. A hierarchical code is given by a sequence 



(4.1) 



(Sfc, Qk, fk*)k>l 
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where Sk is an alphabet, Qk is a positive integer and (ph^, : S>k+i —> S>^'° is an encoding function. 
Since Sfc and Qk are implicitly defined by ipk* we can refer to the code as just {(fik)- 

We will need a composition ipi^ 0Lp2*0' ■ ■ of the codes in a hierarchical code since the the definition 
of the initial configuration for Mi in the amplifier depends on all codes ifi^ . What is the meaning of 
this? We will want to compose the codes "backwards" , i.e. in such a way that from a configuration 

of Ml with cell body size 1, we can decode the configuration — of M2 with cell body 

size B2 = Qi, configuration — (p2{^'^)^ of with body size B3 — Q1Q2, etc. Such constructions 
are not unknown, they were used e.g. to define "Morse sequences" with applications in group theory 
as well in the theory of quasicrystals (jl^, [24[|). 

Let us call a sequence ai, 02, . . . with < < Qk non-degenerate for Qi, (52, ... if there are 
infinitely many k with Ok > and infinitely many k with ak < Qk ~ 1. The pair of sequences 

(4.2) (Qfc,afc)r=i 

with non-degenerate will be called a block frame of our hierarchical codes. All our hierarchical 
codes will depend on some fixed block frame, {{Qk,o.k)), but this dependence will generally not be 
shown in the notation. 

Remarks 4.1. 

1. The construction below does not need the generality of an arbitrary non-degenerate sequence: 
we could have = 1 throughout. We feel, however, that keeping ak general makes the 
construction actually more transparent. 

2. It is easy to extend the construction to degenerate sequences. If e.g. Ok = for all but a finite 
number of k then the process creates a configuration infinite in right direction, and a similar 
construction must be added to attach to it a configuration infinite in the left direction. 

For a block frame {{Qk, Ofc)), a finite or infinite sequence (si, ai), {s2, 02), . . . will be called fitted 
to the hierarchical code {(pk*) if 

</'fc*(sfc+i)(afe) = Sfe 
holds for all k. For a finite or infinite space size N, let 

Bi = 1, 

(4.3) Bk = Qi---Qk-itor k>l, 

(4.4) K = K{N) = sup A: -I- 1, 

Bk<N 

Ok — —aiBi — ... — ak-iBk-i, 

(4.5) Ck{x) ^ Ok + xBk, 
The following properties are immediate: 

oi = Ci(0) = 0, 

(4.6) Ofc = Ofc+i -f flfcBfe, 

Oeok + [0,Bk-l]. 

Proposition 4.2. Let us be given a fitted sequence {sk,ak)k>i- Then there are configurations ^'^ of 
Mk over Z such that for all k > 1 we have 

$''(Ofc) = Sfc. 
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The infinite code we are interested in is Note that in this construction, Sk is the state of the 
site Ok in configuration ^'^ whose body contains the site 0. This site has address in a colony with 
base Ok+i in ^'^+^. 

Proof. Let 

(4.7) 4' 

be the configuration of Mk which has state Sk at site Ok and arbitrary states at all other sites Ck{x), 
with the following restriction in case of a finite space size N. Let ^ Vac only for k < K = K{N) 
and {ok + z) ^ Vac only if 

(4.8) < zBk < N. 
Let 

(4.9) a = ^i*(^(i+i)*(---V(fe-i)*(^fe)---)) 
for fc > i > 1. We have 

(4.10) ik+iiok) = ipkMtliok+i))icik) = ^kiok) 

where the first equation conies by definition, the second one by fittedness. The encoding conserves 

this relation, so the partial configuration Ck+ii^k+i + [0, Bk+i — 1]) is an extension of Cki^k + [0, Bk — 
1]). Therefore the limit = lim^^^ exists for each i. Since (a^) is non-degenerate the limit extends 
over the whole set of integer sites. □ 

Though above is obtained by an infinite process of encoding, no infinite process of decoding is 
needed to yield a single configuration from it: at the fc-th stage of the decoding, we get a configuration 

with body size Bk- 

4.1.3. Controlling, identification. The need for some freedom in constructing infinite fitted sequences 
leads to the following definitions. For alphabet § and field F let 

S.F = {w.F -.w eS}. 

Then, of course, ||§.-F|| = \F\. Let D = {do,... jC^idi-i} C [0,(5 — 1] be a set of addresses with 
di < di+i. For a string s, let 

s{D).F 

be the string of values {s{do).F, . . . , s{d\o\_i).F) so that 

s.F = si[0,Q-l]).F. 
Field F controls an address a in code (f* via function 7 : Si.F ^ Si if 

(a) For all r G §i.F there is an s with (^,(,s)(a).F = r; in other words, (/?*(s)(a).F runs through all 
possible values for this field as s varies. 

(b) For all s we have 93,(s)(a) = 7((^*(s)(o).F); in other words, the field <^*(s)(a).F determines all 
the other fields of (/?*(s)(a). 

Prom now on, in the present subsection, whenever we denote a field by F'^ and a code by (pk* we 
will implicitly assume that F^ controls address ak in tpk* unless we say otherwise. (The index k in 
F*^ is not an exponent.) 

Suppose that fields F^ , F^ are defined for cellular automata Mi and M2 between which the code 
(fi with blocksize Q is given. Suppose that set D satisfies \D\ — |F^|/|F^|. We say that in (p*, field F^ 
over D is identified with F^ if in any codeword w = 93,(5), the string w{do).F^ U - ■ ■ Uw{d^£i\_i).F^ is 
identical to s.F^. Conversely, thus w{di).F^ = s.F^{[i\F^\, (i + 1)|F^| - 1]). The identification of F^ 
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Figure 4. Assume that in the code of this figure, Addr, Age and Worksp are 
constant and Prog^ has always in its position 4. Then address 4 is controlled by 
Info^, and Info^ is identified with Info^ over the addresses 4-17. 



with over D implies that if a simulation has error-correction in over D then the information 
restored there is s.F^. 

Example 4.3. Consider the simulation outlined in Subsection |3.5| , and let us call the encoding ip^,. 
Let F^ be the Info field of Afi : we denote it by Info^ . Assume further that the simulated medium 
M2 is of a similar type, so it has an Info^ field. Assume for simplicity 

6 I Q, \Info^\ = 2, \Info'^\^Q/3. 

The Info^ field of cells in interval [0, Q/S — 1] of a simulating colony represents the whole state of a 
simulated cell of A/2. The Info field on [Q/3, Q — 1] is only used for redundancy. Only a segment of 
the Info field of [0, Q/3 — 1], say the one on [0, (3/6 — 1] is used to represent info of the simulated 
cell. The rest is used for encoding the other fields. Hence Info on [0,(3/6 — 1] is identified with 
Jiifo in our code 95*. 

Let s' — (^*(s)(l) be the state of the cell with address 1 of a colony of Mi which is the result 
of encoding state s of Af2. Let s'.Info^ be the third and fourth bits of s.Info^, s' .Addr = 1, and 
s'.F = for all fields different from these two. Then Info^ controls address 1 in the code 

If for each address a, the field F^ over {a} is identified with F^ then we say that F^ is broadcast 
to F^ (since this means that the code copies the value of s.F^ into the F^ field of each cell of tp^{s)). 
Let us be given 

(4.11) ((§fe,Qfe,(^fe,,F^7fe,afe) :fc> 1) 

where 1 < < Qk — 2, such that in code ipk*, 

(a) F'' over {ak} is identified with F''^^; 

(b) F'' controls address ak for ipk* via 7^; 

Such a system of fields F'^ will be called a primitive shared field for the hierarchical code (ipk*)- If 
also each code ipk*, broadcasts f''^^ into F*^ then we will say that the fields F*^ form a broadcast 
field. Note that the field still controls only a single address a^. The Main_bit field mentioned in 



Subsection 3.1 would be an example. 



Proposition 4.4. For any hierarchical code with primitive shared field given as in ( 4.11 ) above, fi 



for 



all possible values ui G Si.F^ the infinite sequence (sk, ak)k>i with Sk = 7fc(i*i) is fitted. 



The proof is immediate from the definitions. 
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Let US denote the configurations £,^,£,1 that belong to this fitted infinite sequence according to 
Proposition (and its proof) by 

(4.12) , 

AAA. Coding an infinite sequence. Let us show now how a doubly infinite sequence of symbols can 
be encoded into an infinite starting configuration. 
Let us be given 

(4.13) * = (§fc,Qfc,<y2fc,,F'',gfc,7fc,afe)fc>i 

where 2 < < Qk and the sequence ak is non-degenerate, such that in code ^pt*, 

(a) F'' over [0, qk - 1] is identified with F'=+^; 

(b) F'' controls address ak for ipk* via ] 

Such a system will be called a shared field for the fields F*^, and the hierarchical code {(pk*), and the 
fields F^ will be denoted as 

Ffc(^). 

The identification property implies that for all < a < g/j, we have 

(4.14) (^fc,(s)(a).F'= = s.F^+i([a|F'=|, (a + 1)|F'=| - 1]). 

Example 4.5. Let us show some examples of codes (pk in which F'" over [0, qk — 1] is identified with 

A code ?/; = (V'*,V'*) with i/)* : — > 5* will be called d- error- correcting with blocksize Q if for 
all u,v, if u differs from ip^i^v) in at most d symbols then '4'*(u) = v. Assume that both R and S 
are of the form {0, 1}" (for different n). A popular kind of error-correcting code are codes ip such 
that 4'* is a linear mapping when the binary strings in S and R'^ are considered vectors over the 
field {0, 1}. These codes are called linear codes. It is sufficient to consider linear codes ip which 
have the property that for all s, the first \S\ bits of the codeword ■(/'*(s) are identical to s: they are 
called the information bits. (If a linear code is not such, it can always be rearranged to have this 
property.) In this case, the remaining bits of the codeword are called error-check bits, and they are 
linear functions of s. Applying such linear codes to our case, for s S S^+i, let 

w = ipk*is)- 

Then we will have 

Uo<a<Q.w(a).F'^' = ^Pk*is-F''+^) 

for a linear code ipk whose information bits are in Uo<ca<qkW (a).F'= and error-check bits are in 
Ll5fc<a<(5fc w(a).F'^. For c? = 1, if we are not trying to minimize the amount of redundancy in 



the error correction then we may want to use the tripling method outlined in Subsection 3.5 and 



Example 4.3, which sets qk — Qk/S. In this case, the error-check bits simply repeat the original bits 



twice more. <0> 



Example 4.6. In a digression that can be skipped at first reading, let us define the more sophisticated 
linear code we will be using in later construction (a generalization of the so-called Reed-Solomon 
code, see 0). 

Let our codewords (information symbols and check symbols together) be binary strings of length 
Nl for some I, N. Binary strings of length I will be interpreted as elements of the Galois field 
G'F(2') and thus, each binary string c of length Nl will be treated as a vector (c(0), . . . , c{N — 1)) 
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Information bits of F'^'^^ 



_l I I I I I I I I I I I I I I I I I I I L_ 



Error-check bits of F'=+^ 



_l I I I I I L_ 



Figure 5. Error-correcting code in a shared field 



over GF{2^). (Note that the word "field" is used in two different senses in the present paper.) Let 
us fix N distinct nonzero elements of GF(2') and let t < N/2 be an integer. The codewords are 
those vectors c that satisfy the equation 

Af-l 

(4.15) ^a^c(i).F'= = 0(j = l,...,2t) 

j=0 

where the addition, multiplication and taking to power j are performed in the field GF(2'). These 
arc 2f linear equations. If we fix the first N — 2t elements of the vector in any way, (these are the 
information symbols) the remaining 2t elements (the error check symbols) can be chosen in a way to 
satisfy the equations, by solving a set of 2t linear equations. This set of equations is always solvable, 
since its determinant is a Vandcrmonde determinant. 

Below, we will show a procedure for correcting any v <t nonzero errors. This shows that for the 
correction of error in any < t symbols, only 2t error-check symbols are needed. 

li E = (eo, . . . ,eAr_i) is the sequence of errors then the word that will be observed is C + -E. 
Only Bj^ are nonzero for r = 1, . . . , Let Yr = ei^,Xr = ai^. Then we define the syndrome Sj for 
i = l,... ,2tby 

(4.16) S, = ^(c. + ei)c4 = J2 = E ^^^r 

i i r 

which can clearly be computed from the codeword: it is the amount by which the codeword violates 
the j-th error check equation. We will show, using the last expression, that Yr and Xr can be 
determined using Sj . We first define the auxiliary polynomial 

V 

A(a;) = ]J(1 - xXr) = Y^A.x' 

r s=0 

whose roots are X~^. Let us show how to find the coefficients for s > 0. We have, for any 
r = 1, . . . ,1/, and any j = 1,. . . ,2t — u: 

= y^x^+''A(x-i) = ^ A«y^x^+''-^ 

Hence, summing for r, 

(4.17) = Y,^s{Y.^rXl+''-') =J2^sSj+.-s (j = 1, . . . ,2i - z.) 

s=0 r s=0 
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hence using Aq — 1, ^Y^^^-^^sSj+v-s ~ ^Sj+y. This is a system of hnear equations for whose 
coefficients are the syndroms, known to us, and whose matrix is nonsingular. Indeed, M^, = 
ABA^ where B is the diagonal matrix (YrXr) and A is the Vandermonde matrix Aj ^ = X^~^. 

A decoding algorithm now works as follows. For v = 1,2,... ,t, see if M is nonsingular, then 
compute A(x) and find its roots by simple trial-and-error, computing A(ai~^) for all i. Then, find 



Yr by solving the set of equations (4.16) and see if the resulting corrected vector C satisfies (4.15). 
If yes, stop. 

(There is also a faster way for determining A(a:), via the Euclidean algorithm, see 
To make the code completely constructive we must find an effective representation of the field 
operations of GF{2}). This finite field can be efficiently represented as the set of remainders with 
respect to an irreducible polynomial of degree I over GF(2), so what is needed is a way to generate 
large irreducible polynomials. Now, it follows from the theory of fields that 

x"^'^" + x^' + 1 

is irreducible over GF{2) for any s. So, the scheme works for all / of the form 2-3'*. (} 

In a hierarchical code, with a shared field, there is a function X(y) with the property that site y 
of the original information will map to site X{y) in the code. To define this function, let 

be defined like Bk,Ok, Ck{y) but using qk in place of Qfc- For all fc, every integer y can be represented 
uniquely in the form 

fc 

(4.48) y = Y.^y[ - ai)B[ 

1=1 

where < y,' < qi for i < k. Since (uk) is non-degerate for (gfc), this is true even with fc = oo, in 
which case the above sum is finite. Let 

fc 

X{y, i; fc) = X{y, i; fc, *) = ^ {y'„^ ~ a„^)B,r„ 

m—i 

(4.49) X{y,i)^X{y,i;K{N)), 

X{y;k)^X{y,l;k), 

X{y)=X{y,l;K{N)), 

Define the same notation for X' with i?^ instead of Bk- Notice that X{0,i) — X'{0,i) — 0, 
X'{y, 4) — y. Clearly, the sites of form Oi + X{y, i; fc) for all possible y will form a lattice of distance 
Bi. If i < fc then the definitions give 

(4.20) o, + X{y, i; fc) = o,+i + X{y, i + l;k)+ y[B,. 



Using the notation lq introduced in 3.2.4, let us define the aggregated configurations 

(4.24) Q'' = iB'^M 

of body size B'^. over Z. Then, of course, 

(4.22) g^+^ = .,,,(/). 

Let 

Visible{k, N) ^ {y : < y ~ ok < NB'^/Bk }■ 

0<zBk<N 
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Then Xijj) is defined whenever y G Visible{k, N), i.e. the symbols g{y) can be recovered after 
encoding whenever y is in this interval. 

Proposition 4.7. For a hierarchical code with a shared field as given in ( 4.13| ), for an arbitrary 
configuration g = in (Si.F^)^, there are configurations ^'^ over Z such that for all k > I , y G 
we have 

(4.24) ^k*{e^')=e, 

(4.25) eiXiy)).F' = g\y). 
More generally, we have 

(4.26) f (o, + X{y, i)).F = g\o', + X'{y, z)) 

for 1 <i < k. If the space is Zn for a finite N then all these configurations with the above properties 
exist for all k with < N, and (4.25) holds whenever y G Visible{K{N), N) . 

Proof. The proof is mechanical verification: we reproduce it here only to help the reader check the 
formalism. 

1. Let us construct 

For infinite space size, let 

(4.27) C.'(C.(y))=7fe(e'=(C^(y))). 

For finite space size N, define the above only for < N and y in [0, lN/Bk\ — 1], where Ck{y) 
on the left-hand side is taken (mod N). In all other site s x, let S,^{x) = Vac. Let £,1 be d efined 
again by (^j|). We define as in the proof of Proposition It is sufficient to show ( 4.10| ) again 
to prove that the limits in question exist. By definition, 

^t+iM - ipk*i£Uli^k+i)){ak) = (^fc*(7fe+i(/+'(o;i+i)))(a0- 

By the controlling property, its F'' field r completely determines the last expression via 7^. By 
the identification property ( 4.14 ) and the aggregation property ( 4.22 ), 

r = /+i(o',+i)(KSl., (a,. + 1)B', ~ 1]) = e\dk^^ + aM ^ qHo'^). 

By definition, Aio^) — jkiQ'' {o'^)) which proves the statement. 

2. Let us show ( [4.26D . 

Proof. We use induction on i, from k down to 1. The case i = k says 

eiCkiy'k-ak)).F''^g''iC'M-ak)) 

which follows from the definition of Assume that the stat emen t was proved for numbers > i: 
we prove it for i. By the definitions of X{y,i) and ^* and by (4.20) we have 



(4.28) e{o^ + X{y, ij) = e{o^+l + X{y, i + 1) + y[B,) 



By induction. 



¥'»(f+'(om+^(y,» + l)))(2/:) 



r+i(o,+i + X{y, I + l)).F'+i = g^+\o',^^ + X'{y,i + 1)) = g^+\z) 

where z = o[j^i + X'{y, i + 1) can also be written in the form C[j^i{x) for some x. Now we have, 
by the identification property (4.14) and the aggregation property (4.22) 

(4.29) Ai.F = e\z + y[B[) = g\o[^, + X'{y, z + 1) + y[B'^ = g\o\ + X'{y, t)) 

where the third equality is implied by the definition of X'{y,i). 

□ 
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H'' Error-check bits of F''+^ 



F'' Information bits of F''+'^ 



_I I I I I I I I I I I I I I I I I I I I I I I I I I I I L_ 



Figure 6. Error-correcting code in a shared field, with at least one information bit 
per cell 



In analogy with ( 4.12 ), we will denote this code as follows: 

(4.30) t 

r is the limit code with approximations r(-; k) and the function X{y) is the site map of the system 
^I^. Note that for finite space size N, we have r{g) = T{g; k) for the largest k with Bk < N. 
The proof also shows that X{y; k) plays the role of the site map for the approximation: 

(4.31) ek{ny;k)).F' ^ g\y), 

The growth of the quotients X{y)/y is a measure of how the infinite code stretches its input, 
i.e. of the "space redundancy". (Strictly speaking, the redundancy is defined as X{y)/y — 1.) The 
value X{y; k)/y for each approximating code is limited since it stretches blocks of size B'f. into blocks 
of size Bk- If a code has qk = Qk as in the example below then X{y) = y. 



Example 4.8. Let us show a variant of Example 4.5 with qk = Qk- The details can be skipped at 
first reading. 

Field F'"' is a binary string of length Ik ~ hQi ■ ■ ■ Qk-i- Let s £ Sk+i, 

w = (pk*{s), 

v{a) =w{a).F'' (a = 1, . . . ,Qfc-l). 

The information symbols of the code are v{a) for < a < Qk — 1. Let the positive integers m^, 
be such that 

rukhk = Ik, rUkUk < Qk- 

A narrower track w.H'^ contains the error-check bits for the same information, where = hk (see 
Figure]^). For each < i < Uk, the concatenation of strings w(a).Jj'^ with a G [iruk, {i + l)mk — 1] 
will be denoted 

v{i + Qk)- 

These are the error-check symbols. With the code of Example we have — 2t and Qk+rik = N. 
The redundancy of the code is l/nik- <D 

Proposition 4.9. In the example above, l/wfc < oo. 
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Proof. For the capacity of the ceUs in Mk we have 

||S/c|| ^lk + hk+rk = lk{i + i/mk) + Tk 

where rj. is the number of bits not belonging to or . The state of a cell of Mk+i must be 
stored in the fields of the cells of the colony representing it, excluding the error-correcting bits in 
H''. Hence 

Ik+i + hk+i + rk+i < Qk{lk + rk), 
hk+i < QkTk - r-fc+i, 
(4.32) l/wfe+i < Tk/lk - Tk+i/lk+i-, 

oo 

1/mfe < ri/h. 



k=2 



□ 



4.1.5. Infinitely many fields. The above construction will be used mainly to encode the input 
into the configuration and to find the sites where the output can be retrieved. The information 
is kept on each level k in field F*"'. In the case when besides information storage also computation 
will be going on, several configurations may have to be encoded, representing e.g. the output of the 
same computation at different times (see Subsection |6.5[ ). 

Here we will set up the framework for coding infinitely many sequences, each to it own track. Since 
any infinite sequence can be broken up into infinitely many infinite subsequences this elaboration is 
routine, but it is worth fixing some notation. Readers interested only in information conservation 
can skip this construction. 

Let us be given, for k=l,2, ... , < i < k, 

(4.33) * = (§fc,Qfc,^fc*,(Fj^)^ti,ft,Pfc,7fe)fe>i 

such that 2 < qk + Pk l£ Qk, and in code ipk* the following properties hold: 

(a) For each fc, field ljj<fe ^'j controls via 7fe; 

(b) F^ is identified with F*'^^ over [0, qk + Pk ~ 1] if j < A: and over [0, qk - 1] if j = k; 

(c) over qk + [0,Pk- 1] is identified with F^}; 

Such a system will be called a standard system of shared fields and we will write 

F^j ^F'^i^H). 
For simplicity, we only consider infinite space. 



Proposition 4.10. For a standard system of shared fields ^' as in (4.33), there are functions 
Xj(y,'i') = Xj{y) with Xj{Q) = such that for any infinite sequence dj of integers and any in- 
finite sequence of configurations 

Qj e [Si. Flf for j>l 
there are configurations ^'^ such that for all k, j > 1, y ^ Z we have 

e{Xj{y) + d^B^).Fl^g^{y). 

The proof of this proposition is routine computation, so we omit it here. 

The sequence dj gives additional freedom of shifting the origins independently for each j . 
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In analogy with (4.12) we define 

C' = r((e,);*) = r((,,)), 
efc = ^((e,);fc,*) = ^((^>,);fc). 

and call r() the limit code and Xj{y) the site map of this many-field hierarchical code. We may 
want to use the same configm'ation gi for each gj: e.g. if we start the computation on infinitely 
many levels simultaneously, from the same input. 



4.2. The active level. In the (preliminary) definition of error-correction in Subsection |3.6| , we used 
a code 1^9** ^ (p^,. Let us discuss the typical structure of the codes (pk** belonging to a hierarchical 
code. 

If ipk** coincides with (pk* over the field F'^, i.e. for all fc, s, a we have ipk**{s)ia)-F'' — ipk*{s){a).F'^ 
then we will say that (F ) are broadcast fields resp. shared fields with respect to tpk**, too, whenever 
they are such in the code (pk*- Next, we give a slight refinement of this notion for the case of reliable 
computation. 

The definitions given here are only needed if the cellular automata are also meant to be used for 
computation: they are not strictly needed if the goal is only information storage. However, it will 
be convenient to use them even in that case. 

For a shared field, essentially the same space can be used to store the track F'^ as the one used 
for F*^"^^ since the information on the two tracks is the same. Therefore these fields cannot be used 
by the different levels independently of each other for computation. The mechanism enforcing this 
is the error-correction property which restricts the value of F'^ by the value of f'^'^^ (with which 
it is identified). Thus, changing the information in track F'^ we change it on all levels below. We 
should therefore know which level is the "active" one, the one being changed directly rather than as 
a consequence of the code constraints. This level should not be disturbed by error-correction coming 
from higher levels. 

The active level will be marked in the following way. Let us be given, for k ~ 1,2,..., new fields 
G'' with \G''\ = 2. The four possible values of G'' wiU be identified with -1, 0, 1, *. We will say that 
(F^G'=) define a sequence of guarded shared fields if the following properties hold: 

(a) For aU < a < Q, 



-1 ifs.G''+^<0, 
* otherwise; 



s.F'^+1 ifs.G''+^<0, 
* • • • * otherwise. 



Pk**{s){a).G' = 

(b) 

(4.36) pk**{s){[0,qk-l]).F'' = 

Notice that since pk** ^ Pk*, this also imposes some conditions on {p>k*)- 

Typically, for a certain k we wih have C{-)-G' > for all i > k, £,^{').G^ = 0, and f (O-G' < 
for i < k. This distinguished k will show the "active" level, on which the field F'^ can be changed: 
G'^ shows whether we are on, below or above the active level. The fact ^'^^^{■).G'^~^^ > will imply 
that the level (fc -I- 1) does not restrict us from changing ^'^(x).F'^ (e.g. by computation). The levels 
below the active one are the ones subject to error-correction coming from F*^ since the properties 
imply that the F* for all i < k behaves like a shared field in code (iy9i,,) just as it does in code {ipi*)- 
The guard field G'""''^ is broadcast whenever it is negative. 
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With a guarded shared field, if the active level is k then the definition of r(ui; k) — in (4.12) 
will always imply the additional property 

(4.37) ek{^).G'' = 

for all X. Subsequent codings by V'(fc-i)*7 6tc. imply Cfc-G' = —1 for i < k. The active level oo 
will mean ^^{x).G'' = —1 for all k. All propositions in the present section about the existence of 
encoded configurations, can be enhanced to include a guard field, with an arbitrarily chosen active 
level (possibly oo). 

Remarks 4.11. 1. All the tracks F'^ for different k "can use the same space" since G'^^'^ has 
the information showing the way F*^ depends on F'^^^ . However, each track G^ must be 
represented in new space since G'^ is not identified with G'^'^^. 
2. The only place in the present paper where the possibility of changing the G'' field is exploited 



is 5.2.1 





4.3. Major difficulties. The idea of a simulation between two perturbed cellular automata is, 
unfortunately, flawed in the original form: the mapping defined in the naive way is not a simulation in 
the strict sense we need. The problem is that a group of failures can destroy not only the information 
but also the organization into colonies in the area where it occurs. This kind of event cannot therefore 
be mapped by the simulation into a transient fault unless destroyed colonies "magically recover" . The 
recovery is not trivial since "destruction" can also mean replacement with something else that looks 
locally as legitimate as healthy neighbor colonies but is incompatible with them. One is reminded 
of the biological phenomena of viruses and cancer. Rather than give up hope let us examine the 
different kinds of disruption that the faults can cause in a block simulation by a perturbed cellular 
automaton Mi. 



Let us take as our model the informally described automaton of Subsection 3.5. The information 
in the current state of a colony can be divided into the following parts: 

• "information" : an example is the content of the Info track. 

• "structure" : the Addr and Age tracks. 

• "program" : the Prog track. 

More informally, the "structure" does not represent any data for the decoding but is needed for co- 
ordinating cooperation of the colony members. The "program" determines which transition function 
will be simulated. The "information" determines the state of the simulated cell: it is the "stuff" 
that the colony processes. 

Disruptions are of the following kinds (or a combination of these): 

(1) Local change in the "information"; 

(2) Locally recognizable change in the "structure" ; 

(3) Program change; 

(4) Locally unrecognizable change in "structure" ; 

A locally recognizable structure change would be a change in the address field. A locally unrec- 
ognizable change would be to erase two neighbor colonies based, say, at BQ and 2BQ and to put a 
new colony in the middle of the gap of size 2BQ obtained this way, at position 1.5BQ. Cells within 
both the new colony and the remaining old colonies will be locally consistent with their neighbors; 
on the boundary, the cells have no way of deciding whether they belong to a new (and wrong) colony 
or an old (and correct) one. 
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M2 >- Univ 

1* 

Ml Univ 

Figure 7. An amplifier in which the simulations ip''* are "hard- wired" . Universal- 
ity is not lost since each medium Mk also simulates some universal cellular automa- 
ton. 

The only kind of disruption whose correction can be attempted along the lines of traditional error- 
correcting codes and repetition is the first one: a way of its correction was indicated in Subsection ^.5| . 
The three other kinds are new and we will deal with them in different ways. 

To fight locally recognizable changes in the structure, we will use the method of destruction and 
rebuilding. Cells that find themselves in structural confiict with their neighbors will become vacant. 
Vacant cells will eventually be restored if this can be done in a way structurally consistent with their 
neighbors. 

To fight program changes, our solution will be that the simulation will not use any "program" 
or, in other words, "hard-wire" the program into the transition function of each cell. We will not 
lose universality this way: the automata will still be universal, i.e. capable of simulating every other 
automaton by appropriate block simulation; but this particular simulation will differ from the others 
in that the transition function will perform it without looking at any program. 

To fight locally unrecognizable changes, we will "legalize" all the structures brought about this 
way. Consider the example where a single colony sits in a gap of size 2BQ. The decoding function is 
defined even for this configuration. In the decoded configuration, the cell based at site is followed 
by a cell at site 1.5BQ which is followed by cells at sites 3BQ,4BQ, etc. Earlier, we did not 
have any use for these illegal configurations. We must legalize them now. Indeed, since they can 
be eliminated only with their own active participation, we must have rules (trajectory conditions) 
applying to them. This is the real reason for the introduction of generalized cellular automata. 

The generalized cellular automaton actually needed will be called a robust medium. The general- 
ization of the notion of the medium does not weaken the original theorem: the fault-tolerant cellular 
automaton that we eventually build is a cellular automaton in the old sense. The more general 
media are only needed to make rules for all the structures that arise in simulations by a random 
process. 
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5. Main theorems in discrete time 

Some theorems starting with the present section wiU have names of the form "(FCA, [description])" 
where [description] gives a shorthand characterization of the properties of the automaton con- 
structed. 

5.1. Relaxation time and ergodicity. 
5.1.1. Ergodicity. Let 

§„ = §([-".«]nZ)'' 

be the set of configurations on the segment [— ti, n] H Z. Then we can view s e S„ as the vector 
(s_„, . . . , s„). For a measure v over configurations, let 

(5.1) i/(s) = i^{^ii) = s.,, i = -n,... , n}. 

For n = 0, we have the special case h'(s) = f^{C(0) — s}. A sequence of measures weakly converges 
to measure v if for for all n, for each s g §„ we have liuik i^kis) = i'(s). 

If (/z, rj) is a random trajectory of a probabilistic cellular automaton then let /i* be the distribution 
of the configuration ri{-,t). Then there is a linear operator P (called the Markov operator) determined 
by the transition function P(s,r) such that = Pfj,*. To show this it is enough to show how 

/i^(s) is determined by fjP. According to the definition of a trajectory, we have 

n 

(5.2) m'(s) = ^ n irj-i,r„r,+,))t,''{r) 

where the summation goes over all possible strings r G S„+i. 

We call a measure a over configurations invariant if Pa = a. It is well-known and easy to prove 
using standard tools (see a reference in [^) that each continuous linear operator over probability 
measures has an invariant measure. The invariant measures describe the possible limits (in any 
reasonable sense) of the distributions /i*. A probabilistic cellular automaton is called ergodic if 
it has only one invariant measure. It is called mixing if also for every possible measure ijP over 
configurations, /i* = P^jjP converges to one and the same invariant measure. Intuitively, if a process 
is mixing then the distributions /x* will look more and more like the invariant measure and contain 
less and less information about the initial distribution In other words, all information about the 
initial configuration will be eventually lost. 

Remark 5.1. Consider a discrete-time Markov process over a compact space. For an initial dis- 
tribution ^, let Pf_i be the measure for the whole process. Let us define the translation operators: 
{T'r]){x,t) — r]{x,t + l) for space-time configurations, {Tg){r]) — g{T'r]) for functions over space-time 
configurations. Traditionally, the process with initial distribution is called mixing if for each time 
t, each pair of continuous functions /, g where / is defined over the < t sigma-algebra, we have 

J fT'gdP^ - J fdP^J T'gdP^ ^ 0. 

as s ^ 00. It can be seen that a probabilistic cellular automaton is mixing in our sense if each 
process obtained from it by choosing some initial distribution is mixing in this traditional sense. <^ 

A noisy cellular automaton, whenever the set of sites is finite, is a finite Markov chain with all 
positive transition probabilities. This is mixing by a well-known elementary theorem of probability 
theory. If the set of sites is infinite then noisiness does not imply even ergodicity. 



Remark 5.2. No examples are known of noisy cellular automata over an infinite space that are 
ergodic and not mixing. <^ 
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Figure 8. The Toom rule's effect on a large triangular island 



The first example of a non-ergodic noisy cellular automaton was given by Toom. (See e.g. pTf.) 
One of the simplest deterministic cellular automata R given by Toom can be defined as follows. We 
start from a two-dimensional deterministic cellular automaton R with set of states {0, 1}, by the 
neighborhood H = {(0,0), (0, 1), (1,0)}. The transition function Trfj{xi,X2,X3) is the majority of 
xi,X2,X3. Thus, in a trajectory of R, to obtain the next state of a cell, take the majority among 
the states of its northern and eastern neighbors and itself. Toom showed that the rule R remembers 



a field (namely the whole state of the cell) in the sense defined in Subsection |2.5| and is hence 
non-ergodic. 

Remark 5.3. The notion of ergodicity was defined only for trajectories of i?^ that are also trajectories 



of some probabilistic cellular automaton R' (as defined in Subsection 2.3) such that R' C Rg, 
i.e. that all trajectories of R' are trajectories of R^- The difference between R^ and R' is that the 
local transition probabilities of a trajectory of R' are fixed and the same everywhere in space-time, 
while i?£ requires them only to be within some range. Toom's theorem implies that no such R' is 
ergodic. 

5.1.2. Relaxation time: a measure of information loss. Some readers, computer scientists in partic- 
ular, may want to know already now what relevance can results on inifinite cellular automata have 
on have on the possibilities of computation or information storage in finite systems. We will try 
to answer this question here. To stay in the context of the Toom rule, let us consider a (finite or 



infinite) two-dimensional space C. With an extension of the notation (5.1) to two dimensions (where 
s is now the array (s^ : —n < i,j < n)), for any fixed s, let us define the variation distance for fj, 
and v when the latter are restricted to 

d"(Ai,z.) = ^{|M(s)-Ks)| :sG§„}. 

Of course, < dnilJ-, i^) < 2. Distance 2 means that the two measures have disjoint support. Consider 
a set of sites 

where m can be finite or infinite. Suppose that for some m, the local transition matrix (for simplicity, 
with nearest-neighbor interaction) gives rise to a mixing Markov process with Markov operator Pm- 
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Let 

Dl,{t) = \/ dl,{Pl^i,Plu). 

Remark 5.4. It is easy to see that this is equal to 

r,s 

where j^s(s) = 1, i.e. I's is the measure concentrated on configuration s. <^ 
Notice that D'm{t) is monotonically decreasing in t since 

Pjn is mixing if and only if for each s, i/ we have P^v{s) — > iim{s)- By the weak compactness of the 
space of measures, this is equivalent to saying that 

limi?r„W=0 

holds for all n. The mixing of P„j implies that there is an integer function 

rm{n,S) 

with D'^{t) < 6 for all t > rm(n, (5). We will call rm{n,d) the relaxation time . This is obviously 
an increasing function of n (defined only for n < {m — l)/2) and a decreasing function of S. In the 
cases we are interested in, the order of magnitude of rm(ji,S) as a function of n does not change 
fast with the change of 6: Vmin, 0.1)) is not much larger than rm{n, 1.9). This means that once /im 
is not separated well from any P^v it soon becomes fairly close to all of them. Therefore we will 
generally consider 5 fixed. 

5.1.3. Relaxation time as a function of space size. If m < oo then the medium is always mixing. 
Assume now that the medium is also mixing for to = oo: we try to understand the implications of 
this fact for finite m. We have the following relation: 

Lemma 5.5. For all n < (m — l)/2, for all S with roc{n, 5) < (m ~ l)/2 — n we have 

(n, S) < roo{n, S). 

This means that if the medium is mixing for m ^ oo then increasing to in the case of m < oo 
does not increase the relaxation time significantly for any fixed n: in each segment of length n of 
any finite medium, information is being lost at least as fast as in the infinite medium. 

Proof. Take m,n,d satisfying the above conditions and let r = racin,d). Due to the monotonicity 
of D"^{t), it is enough to prove that D^{r) < D'^{r). Take a measure i' over configurations of Cm, 
this will give rise to some measure Voo over configurations of period to in Coo in a natural way, where 
z/qo is such that for all n < (to — l)/2 and all s G S„ we have i^oo(s) = i'(s). Then, r < (to — l)/2 — n 
implies 2ri + 1 + 2r < to, and therefore via (|]^) we have 

PXs)=P>„o(s). 

□ 

We found that mixing of Poo implies a kind of uniform forgetfulness in the sense that increasing 
the size to of the space does not help increasing the relaxation time beyond rao{n,5). 
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5.1.4. Forgetfulness: a variant of ergodicity. Non-ergodicity does not express quite adequately the 
losing of all information about the initial configuration in case of cellular automata, where namely 
the space-time is translation-symmetric (this was noticed by Charles Radin and Andrei Toom). 

Let ^0 be the configuration over the one-dimensional integer lattice that is in the even places 
and 1 in odd places, and that is 1 in the even places and in the odd places. Let fii be the 
measure concentrated on and let P be some linear operator obtained from a transition function. 
Suppose that the measures /'"/io converge to some measure vq. Then the measures P^^i converge 
to some measure ui. Even if vi is different from they differ only by a translation in space. If the 
translations of I'o are the only invariant measures of P then we would still say that in some sense, P 
loses all information about the initial configuration: we might say, it loses all "local" information. 
Indeed, a cell has no way of knowing whether it has even or odd coordinates. 

We can say that P is strongly not forgetful if it has two disjoint (weakly) closed translation- 
invariant sets of measures. Our non-ergodic examples all have this property. 

5.2. Information storage and computation in various dimensions. Let us be given an arbi- 
trary one-dimensional transition function Tr and the integers L. We define the three-dimensional 
transition function Tr' as follows. The interaction neighborhood is H x { — 1,0,1} with the neigh- 



borhood H defined in Subsection 5T above. The transition function Tr' says: in order to obtain 
your state at time t -I- 1, first apply the transition function R in each plane defined by fixing the 
third coordinate. Then, apply transition function Tr on each line obtained by fixing the first and 
second coordinates. (The papers [Q and |jll[ use a neighborhood instead of H that makes some 
proofs easier: the northern, south-eastern and south-western neighbors.) 

For an integer m, we define the space C — Za? x Z^. For a trajectory ^ of CA(Tr) on Z^r, we 
define the trajectory C, of CA{Ti') on C by 

(5.3) (:{i,],n,t)=£,{n,t). 

Thus, each cell of CA(Tr) is repeated vn? times, on a whole "plane" (a torus for finite to) in C. 

Then it is proved in earlier work that there are constants £Q,ci^di > such that the following 
holds. For all N,L, and to = ci log{NL), for any trajectory ^ of CA{Tr) over Zjv, if the trajectory 
C of CA{Tr') is defined by ( ^ ) then for any e < Eq, any trajectory (^,ry) of CA^{Tr') such that 
ry(-, 0) = (■(■, 0) we have for all t in [0, L] and all it; G C 

ti{7j{w,t)^C{w,t)} <die. 

This theorem says that in case of the medium CAs;{Tr') and the trajectories (/i, C), the probability 



of deviation can be uniformly bounded by die. The trajectories rj encode (by (5.3)) an arbitrary 
computation (e.g. a universal Turing machine), hence this theorem asserts the possibility of reliable 
computation in three-dimensional space. The coding is repetition 0(log (NL)) times, i.e. it depends 
on the size N ■ L oi the computation. The decoding is even simpler: if a plane of C represents a 
state s of a cell of CA{Tr) then each cell in this plane will be in state s with large probability. The 
simulation occurs in "real time" . 

The original proof of a slightly weaker version of this result used a sparsity technique borrowed 
from |llo| . In its current form, the theorem was proved in using an adaptation of the techniqe 
of i27|. 



Theorem 2.5 shows that one-dimensional noisy and strongly not forgetful probability operators 
exist. The proof of that theorem seems to require almost the whole complexity of the constructions 
of the present paper (though the continuous-time case adds some additional nuisance to each part 



of the proof). Once the basic structure (an amplifier, as asserted in Lemma 3.7) is in place, the 
simulations in it support arbitrary computational actions and allow the formulation of several other 
theorems. 
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The foUowing theorem asserts the possibiUty of storing much more information. For the finite 



version, recall the definition of Visible{k, N) in (4.23) 



Theorem 5.6 (FCA, 1-dim, storage, discrete time). There is 

• a transition function Tr' with a statespace having a field ; 

• a hierarchical code ^ with a shared field {F^) as in ( pl3| ) such that \ Visible{K{N),N)\ = N; 

• constants di,C2 > 0; 

such that for h2{n) — n^^/iogiogn^ all e > 0, t > 0, for any configuration g with states in S.F^, 
trajectory rj of CAi;{Tr') over C = Zat (for finite or infinite N) with initial configuration T{g; ^'), 
for all y in Visible{K (N), N) , we have 

Prob{ g{y) ^ f]{X{y; t).F^ } < die + te'^-'^^l 

The code can be chosen to have X(y; — y. 

The function e''2(^) shows the length of time for which a space of size N can hold information. 

A fault-tolerant computation is the trajectory of some perturbed medium CA^(Tr') simulating the 
trajectory of an arbitrary deterministic medium. The simulated transition function Tr is arbitrary, 
but we might as well choose it to be a fixed universal cellular automaton. Then CA{Tr') can be 
called a "universal fault-tolerant cellular automaton" . The exact form of the theorems formulating 
the possibility of reliable computation is somewhat arbitrary. It may seem more natural if the 
input of the computation is not given in the initial configuration but is being fed through some cell 
throughout the time of the computation. There are several choices also concerning simplicity, the 
tradeoffs of space- and time-redundancy, etc. These theorems should be considered therefore as just 



illustrations of the possible use of the amplifiers of Lemma 3.7. 

The basic idea is to implement a large but finite computation of Univ by choosing the "active 
level" k mentioned above so large that during the planned computation, it is unlikely that there 
will be any fault at all in the work of medium Mk- If the computation involves N cells and L steps 



then this means that we should have, say, NLet < 1/3. Theorem 5.8 realizes this idea. In it, there 
are still several choices: simpler encoding at the price of larger space redundancy, or more complex 
encoding at the proce of less space redundancy and more time redundancy. 

To formulate the theorems exactly, let Tr be any deterministic transition function with distin- 
guished fields Input, Output in the standard alphabet. We say that Tr has monotonic output if for 
all trajectories rj of CA{Ti) we have 

?7(x, t). Output :< r]{x, t + 1). Output. 

We call the transition function Tr, all of whose fields have even size (i.e. they are in the standard 
alphabet) together with some distinguished fields Input, Output, a standard computing transition 
function if it 



(a) is a separable transition function as defined in 2.2.1; 

(b) never changes Input; 

(c) has monotonic output; 

(d) never changes anything if one of the three arguments is vacant or if the middle argument has 
all ^'s in its input field, or if all three arguments have * in all their fields. 

Remark 5.7. The Input and Output fields here have nothing to do with the Inbuf and Outbuf fields 



introduced in 2.2.1. These are input and output of a certain large-scale computation, while Inbuf 



and Outbuf concern the step-for-step interaction of a cell with its neighbors. <C> 
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Let § be the set of states for Tr, and let g G (S.Input)^ be an "input configuration". Then the 
configuration 

(5.4) C = lnit[Q) 
with states in §, is constructed as follows. 

(a) For all x G Z, ((x). Input — g{x) and C,{x). Output is filled with *; 

(b) For all x G Z, all other fields of C,{x) are filled with 

(c) For all x ^ Z, we have ({x) = Vac. 

If is a configuration in where 1|§1| is even, i.e. elements of § can be viewed as strings in the 
standard alphabet, then we say that g is supported by interval / if g(x) = or vacant for all 

X ^ I. The support of g is defined to be the smallest interval supporting g and containing 0. Let 

(5.5) Suppig) 

be the support of g. For a space-time configuration C of a standard computing medium, let 

(5.6) SuppiC,t)^[JSupp{C{;u)). 

u<t 

Theorem 5.8 (FCA, 1-dim, discrete time, known bounds). Let Tr be a standard computing tran- 
sition function. There is 

• a transition function Tr' with a statespace having afield Output^; 

• constants di,CQ,ci > 0; 

• a hierarchical code with a shared field defined by ^ as in (4.13) 



such that for ha{t) — loglogt + co, hi{t) —t-{ci logt)^'°si°siogi^ j-^^ £^ ^ > q, for any trajectory 



C of Tr over Z with ^(•, 0) of the form Init(g) (see (5.4) ), satisfying 2|Supp(C, t)\ < s, for any finite 
K > ho{t) and possibly infinite N satisfying Supp{C,,t) C Visihle{K , N) , for any trajectory rj of 
CA^{Tr') over C — Z^r with initial configuration T{g; Kj'i'), for all y G Supp{(^,t), we have 

Prob{ C(2/, t).Output ^ ■q{X(y] K, ^!),hi{t)). Output^ } < die. 

The code can he chosen to have X{y; K,'^) = y and \ Visihle{K , N)\ = N. 

Thus, to find a computation result (^(y, u) of CA{Ti)^ from some input, we must do the following. 
Find a bound s on 2|Supp(C,t)| (the amount of space required by the computation if we put the 
origin somewhere into the input), then the level K > ho{t) on which the trajectory rj of the fault- 
tolerant automaton CA^iTr') carries out the computation to the desired stage u with the desired 
reliability until step t. Embed the input into a configuration g whose support contains 0. Encode g 
into an initial configuration r(C(-, 0); -fC) in a space whose size iV is large enough to have Supp{(, t) C 
Visihle{K, N). "Run" the trajectory rj until time hi{t), (it will be programmed to keep the active 
level at K), and finally look up the result in site X{y\K). Due to monotonic output, it is always 
possible to look up the result later. If you look at it too early the worst thing that can happen is 
that it will still be undefined. 

The function hi(t)/t measures time redundancy while X (y; ho{t)) / y measures space redundancy 
(which, as the theorem says, can be made constant). 

The above theorem strengthens the result of ||l^ by eliminating the need for the decoding of 
the computation result: we just have to look in the right place for it. In two dimensions such a 
construction was given in [ pl| . The time redundancy in that paper is 0{\og^^^ (NL)) where N,L 
are the upper bounds on the space and time requirements of the computation. We believe that this 
two-dimensional time redundancy can be reduced to \og^^^ (N L) . 
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5.2.1. Adaptive encoding. Theorem 5.S is not entirely satisfactory since its initial encoding (the 
active level) depends on the knowledge of a time and space bound on the whole computation. It 
is possible to remove this limitation. Instead of formulating the theorem, we just describe the idea 
of the construction. The initial encoding populates the space with cells of Mk for all k < K{N) 
where N is the space size. Chose a level fci of encoding for the computation input depending only 
on the size of the input. For an appropriately chosen (universal) increasing sequence (Lfe), after 
Lfe computation steps have been simulated, the active level will be raised from k to fc + 1. The 
parameters fci and (Lk) are chosen to guarantee with high probability that none of level k cells 
involved in the actual computation will fail before the level will be raised to fc + 1. 

5.2.2. Self- organization. In the just-discussed hierarchical fault-tolerant computations, the initial 
configuration is highly non-uniform since it encodes the hierarchy of simulations described above. If 
we forgo this initialization then the hierarchy must apparently be built up spontaneously. Initially, 
all sites in the media A/2, M^, . . . could be vacant but cells could emerge spontaneously, as lower-level 
cells organize themselves (with the help of some randomness) into colonies simulating higher-level 
cells. We call this feature self- organization, but postpone its formal definition a few sections later. 
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6. Media 



An abstract medium has been defined in 3.3 as essentially a set of stochastic processes called its 
trajectories. Let us narrow this category somewhat. The kind of abstract medium considered in this 
paper will simply be called a medium. We will also restrict the kind of simulations among media 
to mappings that by their very form will obviously be simulations: they will be called canonical 
simulations. 

6.1. Trajectories. Here, we will show in what form the set of trajectories of a medium will be 
given. 

6.1.1. Weak trajectories. In defining the conditions making up a medium we will make use of in- 
equalities of the form 

'Eg <b 



where g is an event function (as defined in Subsection 2.3), measurable in A{V) for some rectangle 
V. 

Example 6.1. Let V — [ai+,a2] x B < d = 0.2(a2 — cii). Suppose that we are given some 

r G S"^, s G §. Let g = 1 if there is a space-time point (a;, to) in V, and an e < d such that 
r]{x + iB, t) — ri for i = —1, 0, 1 during [to — e, to—] and r](x, to) = s. Thus, the event g — 1 marks 
a certain kind of transition in the window. <^ 

A medium will be given by many such conditions. Let us fix an arbitrary countable set T which 
will be called the set of condition types. A medium is given as 

(6.1) M = Med(§,C,5(-,-,-),6(-))- 

Here, S is the set of possible local states and C is the set of possible cell sites. The space C can also 
be omitted whenever it is clear from the context. For all types a G T and rational rectangles the 
medium assigns an event function 

9{a, V, 77) 

and a bound b{a) G [0, 1]. (We do not insist that g{a, V, •) be measurable in A{V) though usually 
this will be the case.) Therefore from now on, a local condition will be defined as a pair 

{a,V). 

The set S is determined by g implicitly. Also, generally local conditions will only be defined for 
small V, and in a translation-invariant way (in space, but only almost so in time since the time 
is distinguished). Therefore one and the same pair g,b can serve for all sufficiently large finite 
spaces C as well as the infinite space, just like with deterministic cellular automata, where the same 
transition function determines trajectories in finite spaces as well as the infinite space. 

When we are given media Mi, M2, . . . then gi{), bi() automatically refer to Mi. 

Two local conditions are disjoint when their rectangles are. A random space-time configuration 
r] will be called a weak trajectory if for each time m > 0, for each set of disjoint local conditions 

(Oj, Vj)j<Ar 

with miiii TTtVi > u, if h is an event function over Au then 

(6.2) E{h Yl g{a„ V,,r^)) <Eh\{ b{ai). 

i i 

This says that we can multiply the probabilities of violating local conditions in disjoint windows, 
i.e. that these violations happen "independently" . 
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6.1.2. Trajectories. The media we are going to be interested in have these three types of condition. 

(1) One type recognizes in y a certain kind of value called "damage" , and has 6 = e for some small 

(2) One type recognizes if there is no damage in V and the transition does not occur according to 
a certain transition function. This has 6 = 0. 

(3) One type corresponds to some "coin tossing" event, and has 6 = 0.5 + e' for a suitable e' . 

If we need coin-tossing (we need it only for the results on self-organization) then we must also consider 
certain sets of conditions that are chosen randomly. A random local condition is a (measurable) 
function [a,V{iiii)) assigning a local condition to each space-time configuration 77. We recall the 
definition of stopping times from the theory of stochastic processes, specialized to our present needs. 
A real random variable a with values in [0, 00—] is a stopping time derived from rj if for each t the 
event a < t \s m At- Let Acr be the algebra of events A with the property that Ar\{cr <t} € At- 
Assume that for each t in [0, 00—] there is random variable V^{rf) measurable in At with values that 
are rational rectangles with 

^tV\ri) C [t,<x\ 

and such that with probability 1 the function V^{r]) is right semi-continuous in t. This will be called 
a rectangle process derived from rj- We mention the following well-known fact from the theory or 
stochastic processes: 

Theorem 6.2 (see ^l| ). V is a random variable measurable in Aa- 

We will omit the words "derived from if when they are clear from the context. A random- stopping 
local condition is given by a triple 

(6.3) {a,V\Ti),a) 

where a is a type, V*'{ri) is a rectangle process and cr is a stopping time, both derived from rj. This 
condition is supposed to express the bound 

E5(a,y-,77) <5(a). 

For some finite set N of indices, let us now be given a sequence 

(6.4) (a„y/,(T,),eJv 

of random-stopping local conditions. It is is called a system of disjoint random local conditions if 
(for almost all 77) all the rectangles V^"^' {r]) are disjoint, and there is a constant upper bound on the 
ai and a constant segment containing all tTsV"/. A random space-time configuration 77 will be called 



a trajectory if for each set of disjoint random local conditions as in (3.4) with ai > u, if ft, is an event 
function over Au then 

(6.5) E{hl[g{i,i^)) <^hl[b{a,) 

i i 

where 

(6.6) g{i,rj)=g{a,,V''^^'^\rj),rj). 

This also says that we can multiply the probabilities of violating local conditions in disjoint windows, 
but requires that this be even true if the windows are chosen by random stopping rules. 
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Figure 9. A system of disjoint random local conditions 



6.1.3. Examples of simple media. 

Example 6.3. The first example will show how a PCA fits into this framework. Let us be given a 
probabilistic cellular automaton PCA(V , B, T). For each state vector r e S"^ and state s, let us form 
type Q(s,r).This condition type says that the transition rules are obeyed at times of the form nT. 
Then for cell x, time t of the form nT with n > 0, let 

g{a{s,v), {x}x[t-T+,t], iq) = {r]{x + iB,t -T) = n {i = il{x,t) = s}, 

6(a(s,r))=P(s,r). 

For all cells x and rational times u we define some new type (} = P{x, u)hy b{(3{x, u)) = and 

g{f3{x,u), {{x,u)}, T]) = {r]{x,u) ^ t]{x,T\uIT\) }. 

This condition says that r\{x^ u) = ri{x,T\u/T\) with probability 1. For all other combinations a, V 
we set g{a, V, rf) = 0. It is easy to see that the trajectories of PCA{P, B, T) are trajectories of the 
medium defined this way. <C> 



Exam,ple 6.4. The new conditions introduced below are "loosened-up" versions of the above ones 
since it is not always desirable to be as restrictive. Let us be given a probabilistic cellular automaton 
PCA(P, B,T). For each set of states E, and state vector r = (r_i,ro,ri) let 

P(i;,r) = ^P(g,r). 
Let K be the set of functions : — > 2^ such that 



(6.8) K{r) c{s^ro:P{s,r)>0}. 

For a space-time trajectory rj and a if S K let us write 

K{x, t, Tj) = K{r]{x - B, t),r]{x, t), tj{x + iB, t)). 
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For all K', K" e K with K' D K" (i.e. K'{y) D K"{r) for all r), let 

) y P(A-(r),r) 

be the least upper bound, over r, on the conditional probability of getting into the set K"{r) provided 
we get into the set K"[r). For each pair of such functions K' D K" let us form the type 

a = a{K',K"). 

For each cell x and u < v oi the form nT, let us form the rectangle V — {x} x [u+,v]. For each 
such pair a, V let g{a, V,ri) = 1 if there is a first t € [u+, v] with ri(x, t) G K'{x, t — rj) and in it, 
we have t]{x, t) G K"{x, t-T.rf). Also, let b(a) = c{K' , K"). <} 

Proposition 6.5. If M is the medium defined by the conditions above then PCA(P, B,T) C M. 

Proof sketch. The statement follows by a standard argument from the strong Markov property of 
the Markov process PCA(B, B, T) (see □ 

Example 6.6. Consider a con tinu ous-time cellular automaton CCA(R, _B). For any functions 
K, K', K" e K as in Example [ej, let 

R(if,r) = ^R(q,r), 

CiK' 

^^""''^ ^ y R(i^'(r),r)- 

We define each a{K' , K"), V ~ {x} x and g{a,V,ri) as before, but now for all rational 

u < V. (} 

Theorem 6.7. Every trajectory of CCA(R, B) is a trajectory of the medium defined in the above 
example. 



Proof sketch. As in |2.4.l| , for a small S > 0, let Ms = PCA{P,B,S) with P(s,r) = (5R(s,r) when 
s 7^ rg and 1 — SJ^s'^ro ^^('^'j'") otherwise. Then a transition to the limit J ^ in Proposition 6.5 
completes the proof. The transition uses the fact that trajectories of M{6) converge in distribution 
to trajectories of CCA(R, B). □ 



Remark 6.8. Of course in both Proposition x5 and in Theorem 6/7, instead of the first time with rj 
switching into K' we could have asked about the second time, or the third time, etc. <) 

6.2. Canonical simulations. 

6.2.1. Definition of canonical simulations. Let us introduce a special kind of simulation, that estab- 
lishes certain relations between the local conditions of the media and thus makes it easier to prove it 
is a simulation. Let Mi , M2 be media and let (p* be a mapping from the space-time configurations of 
Ml into the evolutions of M2: we will write rj* = ip*{r]). We say that Lp* is the map for a canonical 
simulation if the following holds. Let (a, V) be any local condition for M2. Then there is a system 
of random local conditions 

(6.9) ( /3(a, z, j), W'{a, i, j, V, ij),T{a, V, 77) ) 
with i — I, . . . , n(a), j — I, . . . , k{a, i), 

(6.10) mfntV <T{a,i,j,V,7j), 
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such that for each fixed i, the subsystem containing elements for that i forms a system of disjoint 
random local conditions with W*{a,i, j,V,ri) C V, and, defining 

W{a, I, J, V, = (a, z, j, V, 77), 

(6.11) 

gi{a,i,j,V,r]) = gi{(3{a,i, j),W{a,i, j,V,'n),i]), 

we have 

g2{a,V,r]*) <^Ylgi{a,i,j,V,'n), 

(6.12) ' ^ 

62(«)>En^i(/3(a,z,j)). 

* i 

A simulation map will be called a deterministic canonical simulation when the system in (|6.9| ) 
is deterministic: neither r(a, i, j, y, 77) nor W*{a,i, j,V,ri) depends on 77. Under the condition men- 
tioned in the definition of weak trajectories (see (|6.2[ )), we will be able to provide deterministic 
canonical simulations and this is why weak trajectories are sufficient. Canonical simulations are 
"cooked up" to make the following theorem true. 

Theorem 6.9 (Canonical Simulation). Ifip* is a canonical simulation map then for each trajectory 
rj of Ml , the expression ip* (77) defines a trajectory of M2 ■ 

Proof. Let 77 be a trajectory of Mi and (a;, VJ*, <Ji)i^m a system of disjoint random local conditions 
in A/2- Let u be some time such that ai > u for each I, and let h be an event function measurable 
in Au- Wc must show 

E{hYlg2{l,V*)) <'EhY[b2{ai), 

I I 

where .92(^1 C) is defined as in (|6^). By the assumption, for each l,V there is a system of disjoint 
random local conditions 

{(3{ai,i,j),W*{ai,i,j,V,T]),T{ai,i, j, V, 77) ) 

for 7 = 1,... , n{ai) j = 1, . . . , k{ai, i) with the properties defined for canonical simulations. Since 
V has a countable range, we obtain measurable functions even if we substitute VJ"^' (7/*) into V here. 
Let 

and recall ( |6.11 ). 

1. W = W'^{ai,i,j,Vi,ri) is a rectangle process and T{ai,i, j,Vi,ri) is a stopping time. 

= t = t 

Proof. It is easy to see that ntW C [t,oo] and that the process W is upper semicontinuous 

almost everywhere. Let us show that for each t it is measurable in At. We write 

V 

Now ( 6.10| ) and r(a;, 7, j, V, 77) < t implies infTTtV^ < t. On the other hand by the definition of 
random-stopping local conditions, V = Vi{— Vi'^') impfies cr; < inf TTtF. Hence we can write the 
above expression as 

y {W\ai,i,j,V,rj)^W AV^Vi'") Aai<t. 



By Theorem 3.2 



.13) {ai<tAV^Vi'''}eAt 
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By the definition of the rectangle process W^, also 

{W\ai,i,j,V,r,) = W} €At, 

hence indeed {W — W } At- Finally, we have to show 

{T{ai,i,j,Vi,Tj) <t} eAt. 

As above, 

T{ai,iJ,Vi,r]) <t<^\/ {T{ai,iJ,V,j]) <tAV = Vi) 

V 

which again can be written as 

V ( T{ai ,i,j,V,r,)<tAV = Vp )Aai<t. 

V 

We have { r(Q!i, i, j, y, 77) < & At since T{ai,i, j,V,ri) is a stopping time; combining this 
w ith (6T^ ) completes the proof. 
By ( |6.12| ) we have 



(6.14) 
and 



gi{ai,i,j,Vi,ri). 



I I i j i(-) l,j 



where the last sum is over all functions i(-) with G [l,n(ai)]. Similarly, by (6.14) 

^{■) 1,3 

Therefore it is sufficient to show 

ij 1,3 
Let us fix a function Recall that by ( |6.1l| ), 

gi{a,i,j,Vi,T]) = gi{/3{a,i,j),W{ai,i,j,Vi,i]),T]). 

For any l,i, the rectangles W{ai,i, j,Vi,r]) are all disjoint and belong to Vi. Also the sets Vi are 
disjoint, therefore all rectangles W{ai,i{l),j, Vi, 77) are disjoint. Recall that also by (3.11), 



therefore the system 



{P{ai,i{l),j), W{a, i, j, VJ, 77), T(a, i, j, VJ , ry) ) 



taken for all is a system of disjoint random local conditions with u < T(a, z, j, Vi, rj), and hence 
the proof is finished by the trajectory property of 77. □ 
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6.2.2. Frequency of switching times. Adding certain conditions does not really change some media. 
Let us call a canonical simulation injective if the mapping ip* is the identity, and the functions 
62(a), 52(0^, ■) differ from 61, 51 only in that they are defined for some additional types a, too. 
Proving a certain probability bound for the behavior of r/i within some window does not create a 
canonical simulation yet: the bound must be formulated in such a way that this type of probability 
bound can be multiplied with itself and all the other bounds over a system of disjoint windows (and 
these windows can even be chosen via stopping times) . 

Let us illustrate canonical simulations on the the following example. The simulating medium Afi 



is the CCA of Example S.6. The conditions to add impose lower and upper bounds on the frequency 



of switching times of AIi. Namely, for some function K E K.\et 

a^i{K) = /\R{K(v),v), 

r 

a^{K)^\/R{K{r),r). 

r 

Let us call a time u of the space-time configuration rj a K -switching-time of a cell x if for all t < u 
sufficiently close to u we have •q{x,u) e K{x,t,ri), i.e. we have just jumped into the target set 
determined by K{-). 

For some rational constant D > 0, integers I > and j = —1,1 let 

(6.15) g{a{K,D,k,i), {x} x [u+,u^D], ry) 

be the event function of type a{K^ D, k,j) for j ~ —1 [j = 1] for the event that site x has at most 
[at least] k times during [u+, u + D], that are i^-switching times. 

Proposition 6.10. For each transition rate matrix R and function K E K. there is a function 
K{e) > such that for all k > 0, defining, with Oj — aj{K) 

b{a{K, D, (oj + je)D, j j) = e~^^^'>" unless j = 1 and (ai + e)!? = 1, 

(6.16) b{a{K,D,l,l)) = e~'^''^^° AaiD 

(and defining b{a{K, D,k, j)) = 1 for all other choices of k) the trajectories of CCACR, B) are also 
trajectories of the medium obtained by adding all these conditions. 

The proof proceeds by first estimating the event in question by sums of products of single switching 
events and using then a standard large-deviation estimate. 

6.3. Primitive variable-period media. We say that T, > is a dwell period lower bound of a 
space-time configuration 77 if no dwell period of r] is shorter than T,. A continuous-time cellular 
automaton has no such lower bound. 

In the amplifier Mq, Mi, . . . we will construct eventually on top of a continuous-time probabilistic 
cellular automaton Mq, all abstract media Mi, M2, . . . will have a dwell period lower bound, only Mq 
will not have one. Mi will be a so-called primitive variable-period medium: these can be considered 
the continuous-time extension of the notion of an e-perturbation of a deterministic cellular automaton 
with coin-tossing. On the other hand, M2, M3, . . . will only fit into a more general framework (non- 
adjacent cells). 

Let us thus proceed to the definition of medium 

Prim_var(S, C, Tr, B, T,,T\ e). 

The set § of states is implicit in the transition function therefore from now on, it will be omitted. 
We have dwell period lower and upper bounds T, < T* and a failure probability bound £ > 0. The 
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local state, as in Example p.4| , is a record with two fields, Det and Rand where Rand consists of a 
single bit which is, say, always the first bit. 

To simplify the upper-bounding of dwell periods, we assume that the transition function has the 
property 

(6.17) Tr(r_i,ro,ri).Det 7^ ro.Det. 

For a space-time configuration rj, site x and rational number a > let ci = (Ji{x, a, rj), and a2 be 
the first and second switching times t > a of i] in x. Let us list the different types of local condition. 

(a) Conditions of type a{dw_pJb) imposing T, as a dwell period lower bound. We have 
b{a{dw_pjb)) = e, and 

g[a{dw_pJb), {x} x [a, a + T*], rj) 

is the event function saying that ri{x,t) has two switching times closer than T, to each other 
during [a+, a + T,-]. 

(b) Conditions of type a{dw-P-ub) imposing T* as a dwell period upper bound. We have 
b{a{dw_p_ub)) = e, and 

g[a{dw_p_ub), {x} x [a+,a + T'], rj) 

is the event function saying that r](x,t) has no switching times in [a-|-,a -I- T']. 

(c) Conditions of type a(comput) saying that the new value of the Det field at a switching time 
obeys the transition function. We have b{a{comput)) ~ e, and 

g[a{comput), {x} x [a, a + 2T'], rj) 

= {r]{x,a2).Det ^ {n{r),x,t,B).Det : t G [01,02 -T./2] }}. 



Here we used the notation (3.2). 
(d) Conditions of type a{coin,j) for j = 0,1 saying that the new value of the Rand field at a 
switching time is obtained nearly by a fresh coin-toss: 

b[a{com, j)) = 0.5 + e, 
g{^a{coin,i), {x} x [a,a-t-r*], 77) ~ {rj{x , 02) -Rand = j} . 

Condition (^) says that unless an exceptional event occurred, whenever a state transition takes 
place at the end of a dwell period [oi , 172] it occurs according to a transition made on the basis of 
the states of the three neighbor cells and the random bit at some time in the observation interval 
[oi, 02 — T,/2]. Since the observation interval does not include the times too close to (T2, information 
cannot propagate arbitrarily fast. 

Example 6.11. All trajectories of the ordinary deterministic cellular automaton CA(Tr, B,T) are 
also trajectories of Prim_var{Tr, B, T, T, 0). <0 

Theorem 6.12 (Simulation by CCA). For medium 

M = Prim_var{Tr, 1, T„ 1, e) 



with ( 6.17 ) and T, < 1 there is a noisy transition rate R(s, r) over some state space §' and a function 



s' ^ § such that T:{r]{x,t)) is a trajectory of M for each trajectory rj of CCA(R, 1) 



The proof is a straightforward application of Proposition 6.1C 
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6.4. Main theorems (continuous time). The first publication showing the possibility of reliable 
computation with a continuous-time medium (in two dimenstions) is [Q. Here, we formulate the 
new results for variable-period one-dimensional information storage and computation. The following 



theorems are just like Theorems 2.5, 5.(: and 5.8 except that they have a variable-period medium 



Prim_var(Tr', 1, T,, 1, e) with T, — 0.5 in place of the perturbed discrete-time automaton CAg{Tr') 
See the Corollary below for continuous-time interacting particle systems. 

Theorem 6.13 (1-dim non-ergodicity, variable-period). 

There is a one- dimensional transition function such that its primitive variable-period perturbations 
remember a field. 

Theorem 6.14 (FCA, 1-dim, storage, variable-period). There is 

• a transition function Tr; 

• a hierarchical code ^' with a shared field (F'") as in ( 4.13D ; 

• constants di, C2 > 0; 

such that for h2{n) — all e > 0, t > 0, for T, = 0.5, any configuration g with states 

in S.F^ , trajectory rj of Prim_var{Tr, 1, T,, 1, e), over C — Zjv (for finite or infinite N ) with initial 
configuration r(p; ^), for all for all y in Visible {K (N) , N), we have 

Prob{ giy) ^ 7^{X{y- ■^),t).F^ } < die + te'^^^^^. 
The code can be chosen to have X{y; K, '5) = y. 

The main result of the present paper is the following one, asserting the existence of a one- 
dimensional fault-tolerant variable-period cellular automaton: 

The orem 6.15 (FCA, 1-dim, variable period, known bounds). This theorem says the same as The- 
orem 



5.t , with again Prim_var(Tr', 1, 0.5, 1, e), replacing CAglTr') 



For the interpretation of this theorem, see the comment after Theorem 5.8. The construction 
indicated in 5. 2.1] has a similar variable-period counterpart. Theorem 3.12 implies the following. 



Corollary 6.16 (Interacting particle system construction). In Theorems 6.15, 6. 14, 6.1 



replace Tr' with a rate matrix R and Prim_var(Tr', 1, T,, 1, e), with a CCA with rates coming from 



an arbitrary e -perturbation o/R. This proves, in particular, Theorem 2.t. 



6.5. Self-organization. If the input information is a single symbol then an (infinite or large finite) 
hierarchical initial configuration in the above reliable simulation theorems is less natural than one 
consisting of the repetition of this symbol. 

Theorem 6.17 (FCA, 1-dim, storage, variable-period, self-organizing). For each m there is 

• a transition function Tr with state space § having a field Output; 

• a mapping 7 : {0, 1}™ — *■ S; 

• constants C\,C2 > 0, 

such that for h2{n) — n'^s / log log " ^ e > 0, t > 0, for any string s of length m in the standard 
alphabet, any trajectory 77 of Prim_var{Tr, 1,T,, l,e) over C = Z^r (for finite or infinite N) having 
initial configuration ri{y, 0) — 7(5) for all y, the following holds for all x: 

Prob{ s ^ T]{x, t).Output }/ci < e + te'*^^^) + (t^e) A (t""'^ + N-"-'^). 

Though it is not seen from the theorem, hierarchical organization actually emerges here: the 
configuration rj(-,t) will consist of islands of varying levels of "organization" and the level will be 
generally growing with t. For infinite N we can ignore the terms involving N (since they converge 
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to 0), obtaining e + {t^e) At^" **. The term t^^-^ decreases rather slowly. This term is essentially the 
upper bound on the probability that organization does not grow to the required level until time t. 

To formulate a theorem about self-organizing computation, assume that the input is again, as in 
the storage case, just a binary string s fitting into a single cell. (If you want a larger input, we would 
have to encode it into a higher-order cell and repeat that cell.) However, the simulated computation 
may last forever producing ever-growing monotonic output. Where to read the currently available 
output when the input has no distinguished origin? This must depend on the length of the output. 
We can expect that there is a constant co such that for some sequences of lengths B[ < B'2 < ■ ■ ■ , 
Bi < B2 < ■ • ■ , ii the computation output is a string of length > B'j^ by some time t, then by all 
times exceeding a certain time hi (t) , its first segment of length i?^ "can be located somewhere" in 
every interval /fc(x) = -I- cq-B^]. Since this requires the short starting segments of the output 
to occur more densely than the longer ones, we store the output on infinitely many different tracks 
simultaneously. As for the infinitely many fields in Proposition 4.1C, let Y + Xk{y) denote the 
position where the y-th symbol of the A:-th track is stored, if the storage starts at position Y . (In our 
construction, Y will be the site of a cell of the simulated medium Mk whose body fits into Ik{x).) 

The theorem on self-organizing computation says that there is a constant do f^nd a function 
Yk{-,-,-) such that for every k,x,y,t, in every interval Ik{x), with Y = Yfe(a;, /ii(t), 77), with proba- 
bility > 1 — doe we have 

ay,t).Output ^ rj{Y + Xkiy),hi{t)).Fl. 
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7. Some simulations 

7.1. Simulating a cellular automaton by a variable-period medium. The random nature of 
the switching times of a variable-period medium is a tame kind of nondeterminism; any deterministic 
cellular automaton can be simulated by a variable-period medium. To prove this we first introduce 
an auxiliary concept. Let us define the totally asynchronous cellular automaton 

ACA{Tr) = ACA(Tr, 1,1) 

associated with transition function Tr as follows: ry is a trajectory if for all x, t we have either 
ri{x, t -I- 1) = r]{x, t) or the usual 

i]{x, < + 1) = Tr{ri{x — 1, t),rj{x, t),r]{x + 1, t)). 

A site X is free in a configuration if Tr(^(a:: — 1), £,{x), £^{x + 1)) ^ £,{x). The set of free sites will be 
denoted by i(^). For a space configuration ^ and a set E of sites, let us define the new configuration 
Tri^E) by 



Tr{^,E)ix) 



fTr(e(x-l),C(x),e(x + l)) ifx€E 

otherwise. 



Now we can express the condition that 77 is a trajectory of ACA(Tr) by saying that for every t there 
is a set U with 

(7.1) r,{;t + l)^Tr{7j{;t),U). 
Let the update set 

(7.2) U{t,r^) 

be the set of sites x with ri{x, t + I) ^ ?;(a;, t). The initial configuration and the update sets U{t, rj) 
determine 77. For any set A, let us use the indicator function 



x{x,A) = 



1 if x e A, 
otherwise. 



For given 77 we define the function t(.t, t) = t(x, t, 77) as follows: 
(7 3) 

Tix,t + 1) ^ Tix,t) + xix,U{t,r^)). 

We can call t(x, t) the effective age of x in 77 at time t: this is the number of effective updatings 
that X underwent until time t. Given a transition function Tr and an initial configuration ^, we say 
that the function has invariant histories if there is a function ({x,u) = ({x,u,^) such that for all 
trajectories ri{x, t) of ACA{Ti) with 7;(-, 0) = ^ we have 

(7.4) ri{x,t) = (^{x,T{x,t,r]),0. 

This means that after eliminating repetitions, the sequence C,{x, 1), (^(x, 2), . . . of values that a site 
X will go through during some trajectory, does not depend on the update sets, only on the initial 
configuration (except that the sequence may be finite if there is only a finite number of successful 
updates). The update sets influence only the delays in going through this sequence. Let 

Ti{i,E,F) = Tr(Tr(e,i?),F). 

We call a transition function Tr commutative if for all configurations ^ and all distinct pairs x, 7/ G 
L(^) we have Tr(^, {x}, {y}) — Tr{(^, {7/}, {x}). The paper ||l2] proves the theorem that if a transition 



function is commutative then it has invariant histories. In Theorem 7.1 below, we will give a simple 
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example of a universal commutative transition function. For that example, the theorem can be 
proved much easier. 

Below, we will use the smallest absolute- value remainders 



with respect to a positive integer to > 2, defined by the requirement —to/2 < b amod to < to/2. 

Theorem 7.1 (Commutative Simulation). Let Tr2 be an arbitrary transition function with state 
space §2- Then there is a commutative transition function Tri with state space Eii ~ S2 x R (for 
an appropriate finite set R) with the following property. Each state s S §1 can be represented as 
(s.FjS.G) where s.F G S2, s.G G R. Let ^2 be an arbitrary configuration 0/ §2 md let ^1 be a 
configuration of §1 such that for all x we have ^i{x).F — S,2ix), ^i(x).G = - • - G R. Then for 
the trajectory 771 of CA(Tri), with initial configuration ^1, the function rii{x,t).F is a trajectory of 
CA{Tr2). Moreover, in rji, the state of each cell changes in each step. 

In other words, the function Tri behaves in its field F just like the arbitrary transition function 
Tr2, but it also supports asynchronous updating. 

Proof. Let J7 > 2 be a positive integer and 



be three fields of the states of Si, where F = Cur, G = {Prev,Age). The field Age represents 
numbers mod U. It will be used to keep track of the time of the simulated cells mod U, while Prev 
holds the value of Cur for the previous value of Age. 

Let us define s' — Tri{s-i, Sq, si). If there is a j G { — 1, 1} such that {sj.Age — so-Age) amod U < 
(i.e. some neighbor lags behind) then s' = sq i.e. there is no effect. Otherwise, let rg — sg.Cur, 
and for j — —1, 1, let rj be equal to Sj.Cur if Sj.Age — so-Age, and Sj.Prev otherwise. 



Thus, we use the Cur and Prev fields of the neighbors according to their meaning and update the 
three fields according to their meaning. It is easy to check that this transition function simulates 
Tr2 in the Cur field if we start it by putting into all other fields. 

Let us check that Tri is commutative. If two neighbors a;, a; + 1 are both allowed to update then 
neither of them is behind the other modulo U, hence they both have the same Age field. Suppose 
that X updates before a; + 1. In this case, x will use the Cur field of a; + 1 for updating and put 
its own Cur field into Prev. Next, since now x is "ahead" according to Age, cell a; + 1 will use the 
Prev field of x for updating: this was the Cur field of x before. Therefore the effect of consecutive 
updating is the same as that of simultaneous updating. □ 

The commutative medium of the above proof will be called the marching soldiers scheme since its 
handling of the Age field reminds one of a chain of soldiers marching ahead in which two neighbors 
do not want to be separated by more than one step. 



(7.5) 



b amod to 



Cur, Prev, Age 



s'.Cur 
s' .Prev 
s'.Age 



Tr2{r-i,ro,ri), 
SQ.Cur, 

SQ.Age + 1 mod U. 



Remark 7.2. In typical cases of asynchronous computation, there are more efficient ways to build a 
commutative transition function than to store the whole previous state in the Prev field. Indeed, 
the transition function typically has a bandwidth (see 2.2.1) smaller than ||§||. (} 
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Corollary 7.3 (Variable-period simulation). For every deterministic transition function T12 over 
some state-space §2, there is a set of states Si, a transition function Tri 

over §1, and a code that for any values T.^ < T*i, is a simulation of CA{Tr2) by 
Prim.var{Tri, 1, T.i, T'l, 0). 



Proof. Let Tri be the commutative transition function given by Theorem 7.1, with the fields F, G. 
Let ^2 be an arbitrary configuration of S2 and let ^1 be a configuration of Si defined in the statement 
of the same theorem. Let 771 be a trajectory of Prim_var(Tri, 1, T.^, T'l, 0), with the starting 
configuration ^1. 

An update set U{t,rii) similar to ([7.2|) can be defined now for the trajectory 7]i as follows: x is 



in U(t,r]i) iff t is a switching time of 771. Similarly, T(x,i,?7i) can be defined as in (7.3): 

t(x,0) = 0, 

T{x,t) = T{x,t-) +x{x,U{t,'qi)). 

With these, let us define 

a{x, s,£,) = f\{t: t{x, t, 77') = s }, 
V2{x,s,^) = f]i{x,(T{x, s)).F 
By the cited theorem, 772 is a trajectory of CA(Tr2). □ 



The simulation in this theorem is not a local one in the sense defined in Subsection 3.6 since it 
changes the time scale. For an analysis of such simulations, see [|]. 

7.2. Functions defined by programs. Let us recall the definition of a standard computing tran- 



sition function as introduced in Subsection 5.2, and let us call a cellular automaton with such a 
transition function a standard computing medium. For a standard computing transition function 
Tr, integers s and t and string X consider a trajectory 77 of CA{Tr) over the rectangle [0, s] x [0, t] 
with an initial configuration in which 77(0, 0) = 77(5, 0) = # • • • further X is on the input track on 
[1, s — 1] (padded with *'s to fill up [1, s — 1]), *'s on the output track and O's on the other tracks. 
This defines a trajectory 77 since the #'s on the input field in cells and s imply that the cells 
outside the interval [0, s] will have no effect. 

Assume that at time t, the Output track has no * on [1, s — 1]. Then the output never changes 
anymore (it is monotonic in standard computing media). The string w on the Output track on 
[1, s — 1] will be called the result of the computation, denoted by 

w = Tr{X; s, t). 
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X on Input 
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Figure 11. Definition of trans{X ; s , t) 



7.2.1. Efficient universality. We say that the standard computing transition function Trg is effi- 
ciently universal if for every standard computing transition function Tr, there is a string prog and 
constant c such that for all strings X and positive integers s,t we have 

Tr{X; s, t) = Tro{prog U X; cs, c(s + t)) 

whenever the left-hand side is defined. In other words, Tro can simulate the computation of any other 
standard computing transition Tr if we prepend a "program of Tr" . The space and time needed for 
this simulation can only be a constant factor greater than those of the simulated computation. 

Theorem 7.4. There are efficiently universal standard computing transition functions. 



Sketch of the proof. This theorem can be proven similarly to Theorem |3.5| . The main addition to 
the construction is that before Tro starts a block simulation of Tr the input will be distributed 
bit-by-bit to the colonies simulating the cells of Tr. At the end, the output will be collected from 
these colonies. □ 

We fix an efficiently universal standard computing transition function 
(7.6) Univ 



for the definition of various string functions. In view of Theorem 7.1, we can (and will) also assume 



that Univ is commutative. For a string function f{X) defined over some domain E we will say that 
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prog is a program for / with time complexity bound t and space complexity bound s over E if we 
have 

Univiprog U X; s, t) = f{X). 

It is often convenient to define a finite function (like a transition function of some medium, or a 
code) by its program (for Univ) rather than its transition table, since for many interesting functions, 
the program can be given in a more concise form. 

7.3. The rule language. This subsection defines a language for describing a transition function 
that is more convenient than giving a complete transition table. A transition function will generally 
be defined by a set of rules which impose some conditions on it. In case some rules are contradictory 
and no precedence is stated between them then rules given later override the ones given earlier. 
Rules are sometimes associated with certain fields. The lowest-priority rule referring to a field is 
called the default. 

Some rules will not be applicable directly, but will rather be used by other rules: these will be 
called subrules. A subrule R is just like other rule, except that, typically, there is some other rule 
saying in which time interval of the work period to invoke R. 

Our use of the rule language will not be completely formal but it will be evident how to make it 
formal. In a semi-formal description of a rule, x refers to the cell to which it is applied. This is only 
a convenient shorthand that can be replaced with references to fields of the cell and its neighbors. 
We will write 

i^j{x) =x + jB 

for the site j steps from x. In the condition as well as the action of the rule, a field F(x) will simply 
be written as F. We will often write 

For example, for field Addr, instead of writing "if r].Addr{x) = and Addr(i?j(x)) = 1" we will 
write "if Addr and Addr' = 1" . 

The simplest possible rule is an assignment of the form F := g where F is a field and (/ is a 
function of some fields, e.g. F := + H^. Another possible rule is the conditional: 
cond{ 
? Ci 
! Ai 

? Ck 
! Afe 
[?! Ak+il 
} 

Here, Ci,C2,... are conditions: inequalities and equations in terms of the states of 'i?j(x), and 
Ai, . . . , Ak+i arc possible actions each of which can be another rule. We can use logical connectives, 
even (finite) quantifiers in the conditions. The above rule says that if condition Ci is satisfied then 
the action Ai must be performed. Else if C2 holds then action A2 must be performed, etc. Finally, 
and optionally, (this is meant by "[,]") if all conditions fail then we may require a default action 
Afe+i to be performed. The symbols ?, ! , ? ! abbreviate condition, action and default respectively. 
An alternative way of writing would be 

if Ci { 
then Ai 
else if C2 



60 



PETER GACS 



then A2 

else Ak+i 
} 

Remark 7.5. We prefer the first form (borrowed from the programming language LISP) only in order 
to put all conditions formally on the same level. <0 

The rule 

Ri II R2 

is a combination of two rules. It says that both rules must be carried out simultaneously. In case 
the two rules are just two assignments we will sometimes omit the sign || . The rule 

asks for carrying out the rules Ri and R2 consecutively. This informal notation can be understood 
in terms of a field Age which we are always going to have and can be replaced with a conditional. 
E.g., whenever we write 

Retrieve ; 
Eval 

then this can be replaced, using appropriate constants U, with 
cond { 

7 ti< Age < t2 
! Retrieve 
7 t2< Age < ts 
! Eval 

} 

We will also use the construct 

for i = a to 6 ( some rule referring to i) . 
The construct 

repeat k times (• • • ) 
is a special case. 

A subrule can have parameters (arguments). Subrule P{i) with parameter i can be viewed as a 
different rule for each possible value of i. An example could be a simple subrule Read-Mail{j) which 
for j € { — 1, 1} has the form 

Mail := MaiP 

for the field Mail. We will also have functions: these are defined by the information available in the 
arguments of the transition function, and can always be considered a shorthand notation: e.g., we 
could denote 

f{i) = Addr + Age\ 

The construct 
let k = . . . 

can be understood as notational shorthand (though could also be incorporated into the rule lan- 
guage). 



Example 7.6. Here is a description of the transition function given in the proof of Theorem 7.1 
(Asynchronous Simulation) . 
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rule March { 

for {-1,0,1}, { 

let r{j) = Cur' if Age-' =Age, and Prev-' otherwise; } 
cond { 

? Vj e {-1, 1} {Age^ - Age) amod U >0 

! { 

Prcv := Cur; 

Cur:=Tr2(r(-l),r(0),r(l)) || 

Age := Age + 1 mod U 

}}} 



Wc will also have a field Addr. A colony is a segment of cells with addresses growing from to 
Q — 1 modulo some constant Q. 

A location is a pair (F, I) where F is a field, and / is an interval of addresses in the colony. We 
will denote it as 

F{I). 

As an element of tlie language, of course, a location is simply a triple consisting of the field, and 
the numbers determining the endpoints of the interval. A location is meant to specify a sequence of 
symbols on track F. If a shorthand name is given a location, this will generally begin with a minus 
sign, like 

Jnfo. 

Remark 7.7. Occasionally, we may treat the union of two or three locations as one. It is easy to 
generalize the rules dealing with locations to this case. 

Let us be given a string S consisting of a constant number of runs of the same symbol. For 
example, 0™!" has one run of O's and a run of I's. Let us also be given a location loc. Then a rule 
Write{S,loc), writing the string S to the location loc, can be written as a conditional rule, as long 
as there are runs of O's and I's: 

subrule Write(0'"l",F([a,a + m + n- 1])){ 
cond { 

? a< Addr <Q A{a + m) 
! F := 

? a + m< Addr <Q A{a + m + n) 
! F := 1 

}} 

The rule language can contain some definitions of names for constant strings of symbols, of the form 
(7.7) Paramo = sq, Parami = Si, Param2 = S2, ■ ■ ■ ■ 

where Si are some strings. The names Pararui are allowed to be used in the rules. Let us even agree 
that the whole program is nothing but a sequence of such definitions, where the first string sq is the 
sequence of all the rules. In this case, using the name Paramo in the rules is a kind of self-reference, 
but its interpretation is clear since it denotes a definite string. 

Theorem 7.8 (Rule Language). There is a string Interpr and an integer (interpr_coe) such that 
the following holds. If string P is a description of a transition rule Tr over state set S in the above 
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language (along with the necessary parameters) then the machine Univ defined in ( |7.6| ) computes 
Tr(r-_i, To, ri) (possibly padded with * 's) from 

Interpr U P U r_i U ro U ri 

within computation time (interpr_coe)(|P| + 1)^||§|| and space (interpr_coe)(|P| + ||§|| + 1). 

Sketch of proof. A detailed proof would be tedious but routine. Essentially, each line of a rule 
program is some comparison involving some fields: substrings of a state argument r^. We have (|P| + 
1) squared in the time upper bound since we may have to look up some parameter repeatedly. □ 

From now on, by a rule program Trans-prog of the transition function Tr, we will understand 
some string to be interpreted by Interpr. 

7.4. A basic block simulation. The simulation described here is used just to demonstrate the 
use of the notation and to introduce some elements of the later construction in a simple setting. Let 
a transition function Tr2 be given. We want to define a cellular automaton Mi = CA{Tri), whose 
trajectories simulate the trajectories of M2 — CA{Tr2,Q,U), with appropriate Q,U. Of course, 
there is a trivial simulation, when Mi = M2, but a more general scheme will be set up here. This 
simulation is not one of the typical simulations by a universal medium: the cell-size of Mi depends 



on M2 as in Example 3.1. The construction will be summarized in Theorem 7.13 below 



In the construction, we introduce some more notation that can be considered a shorthand in writ- 



ing the rules but can also be incorporated into the rule language (without invalidating Theorem 7.8 
(Rule Language)). 

7.4.1. Overall structure. The transition function Tr2 : ^ ^2 to be simulated is given by a rule 
program Trans-prog2. To perform a simulated state transition of M2, a colony of Mi must do the 
following: 

Retrieve: Retrieve the states of the represented neighbor cells from the neighbor colonies; 

Evaluate: Compute the new state using Tr2; 

Update: Replace the old represented state with the new one. 



The field Addr holds a number between and Q — 1, as discussed in Subsection 3.1, The default 
operation is to keep this field unchanged. The time steps within a work period of a colony are 
numbered consecutively from to [/ — 1. The field Age holds a number between and U — 1 
intended to be equal to this number. The default operation is to increase this by 1 modulo U. These 
default operations will not be overridden in the simple, fault-free simulations. Using these fields, 
each cell knows its role at the current stage of computation. 

On the track Info, each colony holds a binary string of length ||S2||. For a string S £ E>f , the 
decoded value f*{S) is obtained by taking this binary string. The encoding will be defined later. 
The default operation on the information field is to leave it unchanged. It will be overridden only 
in the last, updating step. For simplicity, let us take \Info\ = 2, i.e. the Info track contains only 
symbols from the standard alphabet. The field Cpt will be used much of the time like the cells of 
a standard computing medium, so its size is the capacity lUruVj = ||S[/,i,v|| of the fixed efficiently 
universal standard computing medium. It has subfields Input, Output. The field Cpt. Input is under 
the control of the rule Retrieve, while rest of Cpt is under the control of the rule Eval. 

The whole program can be summarized as 

Retrieve ; 

Eval; 

Update 
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7.4.2. Mail movement. Let x be the base of the current colony under consideration. For i = —1, 0, 1, 
we will indicate how to write a subrule 

Copy{i, loci,loc2) 

that copies, from the colony with base x — iQ, the location 7oci to location I0C2 of the current colony. 

Remark 7.9. In the present section, for convenience, we assume that the tracks in loci and I0C2 have 
the same width. Similarly, we assume that the mail track is at least as wide than any of the tracks 
that it serves during copying. This assumption might no t be v alid if the transition function we are 
designing is separable with some small bandwidth (as in 2.2.1 ) since the mail field must be part of 
Buf. 

Copying can be organized also without the assumption. Suppose e.g. that the mail track and ioc2 
is narrow, and the track of loci is wider by a factor fci than these. Then loci can be considered to 
be the union of fci locations of smaller width and the copying of loci into J0C2 will need ki separate 
copying operations. <0> 

With the help of this rule, we will have, with appropriate locations -Retrievedm, 

subrule Retrieve { 

for m e {-1,0, 1} do { 

Copy{m, -Info, JRetrievedm) 
}} 

here _Info is the location on the Info track containing the represented string. For the rule Copy we 
use a framework a little more general than what would be needed here, with a variable-time version 
in mind. Let 

(7.8) JVbJnd = {-1,0, 1}, 

(7.9) MailJnd = {-1.1,-0.1,0.1,1.1}. 

In colony with base x, track 

Mailk, k e MailJnd. 

is for passing information to colony with base x + sign(fc) [|fc|J _B, in direction sign(fc). Field Mailk 
has subficlds 

Fromaddr, Fromnb, Info, Status. 
For simplicity, let \Mailk.Info\ = \Info\. The field Status can have the symbolic values 

Normal, Undef. 

The default value of Mailk. Status is Normal. When Mailk. Status = Undef then the other subfields 
of Mailk will play no role therefore we will, informally, also write Mailk — Undef. For adjacent 
cells X, y with j = y — x, and their colonies x* and y* (defined from their Addr field) we define the 
predicate 

fO iix* = y*, 

Edgej{x) = \ 1 if a; and y are endcells of two adjacent colonies, 
[00 otherwise. 

The mail track Mailk of cell x will cooperate, in direction j, with mail track peer{k,j) where 

peer{k,j) =k- jEdge^ix) 
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Figure 12. Copy{-l,Fi{[ai,ai + n-l]),F2{[a2,a2 + n-l])) 



if the latter is in MailJnd. For j = — sign(fc), we define 

MaiLto^eceive{k) = MaiJ^^^^^ . 

as the mail to be received into Mailk provided peer{k,j) e MailJnd and Undef otherwise. The 
one-step rule Move-mail gets mail from the neighbor cell: 

subrule Move-mail { 

pfor k in MailJnd do { 
Mailk ■= MaiLto-receive{k) 
}} 

Here, pfor is a "parallel for": the rule is carried out simultaneously for all k (i.e. for all fields 
indexed by fc). A cell will typically copy the information to be sent into Mailk -Info and at the same 
time, its own address into Mailk -Fromaddr, and — sign(fc)[[fc|J into Mailk-Fromnb. In the copy rule 
here, i G { — 1,0,1} refers to the direction of the sending colony as seen from the receiving colony. 
The next argument is the location of origin in the sending colony, the one following is the location 
to receive the information in the receiving colony. 

subrule Copy{i, Fi{[ai, ai + n - 1]), F2{[a2, a2 + n — 1])) { 
let = l.li if i 7^ and 0.1 sign(a2 — ai) otherwise 
let I = 0.1 sign(fc) 
Mailk -Fromaddr := Addr 
II Mailk-Fromnb := —i 
II Mailk -Info := Fi; 
repeat 2Q times { 
Move_maiJ 
II cond { 

? Addr e [02, 02 + n - 1] and 

Addr — a2 — Maik-Fromaddr — ai and —i = Maili.Fromnb 
! F2 := Maili-Info 

}}} 

Remark 7.10 (Indirectly given locations). The description Fi([ai,a2]) of a location can fit into a 
single field Fq of some cell. We will allow that some argument locj of the rule Copy{i, ioci, J0C2) is 
given by a field Fq this way. This will be done only if z = 0, i.e. the copying proceeds within one 
colony. When executing the copy rule, it will be assumed that field Fq of each cell of the colony 
contains the same information locj. Therefore the rule can be written just as above, except that 
some of its parameters are read now from Fq. <C> 
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7.4.3. The evaluation rule. The subrule Eval controls the track Cpt \ Cpt. Input. The first steps 
of Eval write the interpreter, and the program of the transition function to be simulated into 
appropriate locations on the Cpt. Input track, followed by writing the program Trans-prog2 found 
as Parami after the present rules (see ([7.7D): 

Write {Interpr, Jnterpr) ; 
(1) Write {Parami, _Prog) . 

(If Tri = Ti-2 (self-simulation) is desired then write Write{Paraino, -P^og) in part (P above, and 
Parami, after it.) Then Param2, . . . will be written after Parami. Next, the rule Eval executes 

Copy{0, _Retrieved„i, -Arg^) 

for m g NbJnd where -Arg^ are locations for the arguments of the transition function on the 
Cpt. Input track. For initialization, the rule writes * ■ • • * to the Output track and the rest of the cells 
(including the endcells) of the Cpt. Input track, and O's to the track Cpt\{Cpt.InputUCpt. Output). 
Then for a sufficient number of steps, the transition function Univ will be applied to the Cpt track. 
According to Theorem (Rule Language), the computation finishes in 

(interpr_coe) ( I Trans_prog2 1 + 1)^||§2|| 
steps, so this number of iterations is sufficient. 



Remark 7.11. If the transition function to be simulated is separable as defined in 2.2.1 then what has 



been computed is not Tr2 but the auxiliary function Tr^2"' ■ this, the meaning of field [Sw, Aw — 1] 
is an address, of a target field to be changed in the simulated cell. The evaluation rule will broadcast 
this number into the field Target_addr of each cell of the colony (it fits into a single field) to be used 
eventually in Update. <C> 



The subrule 



copies the track Cpt. Output into track Jnfo. 



Update 



Remark 7.12. When the transition rule Tr2 to be simulated is separable (as in 2.2.1) then the 
appropriate locations of Cpt. Output must be copied to the appropriate locations of Info. Some of 
this copying uses locations defined in an indirect way as in Remark 7.1C| , using the field Target_addr 



mentioned in Remark 7.11. 



7.4.4. Summary in a theorem. The encoding tp^ of a cell state v of M2 into a colony of Mi is defined 
as follows. The string v is written into Jnfo. The Cpt track and the mail tracks are set to all I's. 
Each Age field is set to 0. For all i, the Addr field of cell i of the colony is set to i. 

The theorem below states the existence of the above simulation. As a condition of this theorem, 
the parameters 

(7.10) Trans.prog2,|l§i||,l|S2||,Q,C/ 

will be restricted by the following inequalities. 
Cell Capacity Lovirer Bound: 

IISill >Ci[logC/] + I l/niv I +C2 
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where Ci , C2 can be easily computed from the following consideration. What we really need is 
||Si|| > |Addr| + \ Age\ + \Info\ + \Mail\ + \ Cpt\ where the following choices can be made: 

\Info\ = 2, 

\Maili\ = \Maih.Fromaddr\ + \Maik.Fromnb\ + \Maik.In{o\ = [logQ] +2 + 2, 

\Cpt\ = \Univ\, 

\Addr\ = \Age\ = \logU]. 

Colony Size Lower Bound: 

Q > (interpr_coe)(||S2|| + \ Trans-prog2 \ +logJ7 + 1). 

With the field sizes as agreed above, this provides sufficient space in the colony for information 
storage and computation. 
Work Period Lower Bound: 

U >3Q + (interpr_coe)(|Traiis_prog-2| + 1)^||S2||. 
With the field sizes above, this allows sufficient time for the above program to be carried out. 
It is not difficult to find parameters satisfying the above inequalities since logQ <C Q. 

Theorem 7.13 (Basic Block Simiilation). There are strings Sim_prog(), Siin_progi such that the 
following holds. If Trans-prog2, 11^2 1|, Q , U satisfy the above inequalities then 

Paramo = Siin_progi, Par ami = Trans-prog2, 
Param2 = Params = ||S2||, 

Paranu = Q, Param^ = U 

is a rule program of a transition function Tri such that CA{Tri) has a block simulation 
of CA[Tr2,Q,U). Also, if ||§i||, ||§2||j Q, satisfy the above inequalities with §i = S2 and 
Trans-prog2 = Sini_progQ then 

Paramo = Sim_progQ, Parami = empty string, 

Param2 = ||Si||, Params = ||Si||, 

Param4 = Q, Param^ = U 

is a rule program of a transition function Tri such that CA{Tri) has a block simulation of 
CA(Tri,0,[/). 

The given construction is essentially the proof. Its complete formalization would yield Sim_progQ 
and Sim_progi explicitly. 
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8. Robust media 

This section defines a special type of one-dimensional medium called robust. From now on, 
when we talk about a medium with no qualification we will always mean a robust medium. For 
help in understanding the definition can be compared to the primitive variable-period media in 
Subsection ^.3| . The media defined in the present section have some features distinguishing them 
from cellular automata: non-lattice cells, damage, communication among non-nearest neighbors. 



The need for non-lattice cells was explained in Subsection 4.3 



8.1. Damage. Robust media have a special set of states Bad C §. For a space-time configuration 
f] we define the damage set 

Damage{rj) — { {x, t) : j]{x, t) G Bad }. 

For a space configuration, the damage is defined similarly. The interpretation of the set Damage{ri) 
is that when {x,t) £ Damage{r]) then in the neighborhood of {x,t), we will not be able to make 
any predictions of 77, i.e. in some sense, ry behaves completely "lawlessly" there. When 77(2;, t) € Bad 
then it is irrelevant whether we regard site x occupied by a cell or not. For a cell x a time interval 
/ is damage-free if 77(2;, •) is not in Bad during /. 

Remark 8.1. In all cellular media concerned with our results we could require Damage{r]) = 0. The 
damage concept is necessary only in a trajectory 772 of a medium M2 obtained by simulation from a 
trajectory rji of some medium Mi. Such a simulation typically requires some structure in 771. When 
noise breaks down this structure the predictability of 772 suffers and this will be signalled by the 
occurrence of damage in 7^2 . 

However, for convenience, we will define damage even in the media used on the lowest level, as a 
violation of a certain transition rule. <C> 

According to the general definition of a medium, the set of trajectories will be defined by a pair 
b{-),g{-, ■, ■) where b{a) give the probability bound belonging to type a and g{a,W,ri) is the event 
whose probability is bounded. We formulate these in terms of "properties". For a medium with 
cellsize B, let 

(8.1) V = [-B/4, B/4-] X [-T./4+, 0]. 

The Computation Property constrains the kinds of events that can occur under the condition that 
the damage does not intersect certain larger rectangles. One condition type a{restor) will be called 
the Restoration Property: it bounds the probability of occurrence of damage in the middle of some 
window {x, t) + 2V. The Restoration Property, which depends on a new parameter e, says that the 
probability of the damage is small: even if other damage occurs at the beginning of a window it will 
be eliminated with high probability. 

Condition 8.2 (Restoration Property). Let b{a{restor)) = e. Further, for any rational pair {x,t) let 

g{^a{restor), {x,t) + 21^,77) 
be the event function for the event that there is damage in ri{{x,t) + V). {> 

The Restoration axiom says that damage, i.e. the occasional obstacle to applying the Computation 
Property, disappears soon after its occurrence: namely, it has very small conditional probability of 
occurrence in the inner half of any rectangle (x, t) + 2V. 

The axiom will be automatically satisfied on the lowest level by the property of the medium that 
the transition rule is only violated with small probability. 
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Figure 14. Cause for damage in point (x.t) of the simulated trajectory 



Remark 8.3. In the model of [ |ri| , the restoration axiom is weaker. There, damage does not neces- 
sarily disappear in a short time but if it is contained in a certain kind of triangle at the beginning of 
the window then, with high probability, it is contained in a smaller triangle of the same kind later 
in the window. <C> 

Damage helps us present a hierarchical construction as a sequence of simulations. When a large 
burst of faults destroys the fabric of these nested simulations, then ry'^+i cannot be explained just 
in terms of the t]'' from which it is decoded. The damage set will cover those lower-level details 
of the mess that we are not supposed to see as well as the mechanism of its removal. Damage 
points can be viewed as holes in the fabric of the lawful parts of a trajectory Consider a simulation 
fj* = between two robust media. Let us define the damage set of rf to be used in almost 

all such simulations. As usual, let us mark by * the corresponding quantities in the simulated 
medium when some simulation is fixed. We will define rf{x,t) G Bad iff Damage{rj) contains at 
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least two points m, v such that u + 2V, v + 2V are disjoint and even u + 3V, v + 3V are contained in 
ix,t)+AV* + (B*/2,0). 

The value of s, provided it is not in Bad, will be essentially defined by a block code cp as 

However, there will be some look-back in time for stabilization, i.e. the simulation will not be 
memoryless. The goal of the damage definition is to relieve us of the need of simulating a transition 
function if there is too much damage in the big rectangle {x, t) + AV* + {B* /2, 0). 

Lemma 8.4 (Simulation Damage Probability Bound). Let M , M* be media with parameters e, e* 
whose local condition system is defined by the Restoration Property. Let a simulation be defined 
between them which assigns damage in M* according to the above damage map. If 

e* > 25{{B*/B){T,*/T,)ef 



then, is a deterministic canonical simulation map as defined in 6.2.1 



This lemma says that small bursts of damage that are not too close to each other behave in the 
medium M as if they were independent, therefore the probability of the occurrence of two such 
bursts can be estimated by times the number of such pairs in a rectangle V* . 



Proof. It is enough to show an expression of the form (6.12) for local conditions of type restor in 
M*. Let xi,ti be the sizes of the space and time projections of V. We define the following lattice 
of points: 

V = { {ixi,jti) ■.i,j el}. 

Then the rectangles v + V for w G V form a partition of the space-time. For each u in space-time, let 
v{u) be the v G V with u € v + V. Let 77 be a trajectory of M. It is enough to consider the rectangle 
2V* . If Da.mage{r]*) intersects V* then there are some points ui,U2 in Damage{rf) such that Ui + 2V 
are disjoint and even Ui + iV are in 5^* -I- (i?*/2,0). Then it is easy to see that even v{ui) -\- 2V 
are disjoint from each other and are contained in hV* + {B* /2,{)). Let n be the number of pairs 
vi, V2 in V with this property. We found n pairs of rectangles Uij + 2V {i = 1, . . . ,n, j = 1,2) such 
that Ui^i + 2V n Ui^2 + 2V — 0, and that if Damage{rj*) intersects V* then there is an i such that 
Daniage{ri) intersects Uij + V for j = 1, 2. We found 

g* {restor, 2V*,T]*) < ^ J]^ 5 ( restor, m.j +2V,t]). 

i j 

To complete the proof, observe that 



5Z n ^(^^s^o^) — ^^"^ 



Since counting shows n < 25((i3*/i3)(T,*/T,))^ and the assumption of the lemma says 
25((i3*/i?)(r,*/T,))^e^ < £* both conditions of a deterministic canonical simulation are satis- 
fied. □ 



8.2. Computation. 
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8.2.1. Neighborhood structure. Before giving the Computation Property, let us introduce some de- 
tails of the structure of a robust medium. Cells sometimes have to be erased (killed) in a trajectory 
since cells created by the damage may not be aligned with the original ones. At other times, the 
creation of a live cell at a vacant site will be required. Most of the trajectory requirements of a 
robust medium will be expressed by a transition function Tr, desribed later. Killing and creation 
may be indicated by some special values of this function. 

Although we permit non-lattice configurations we could still use a transition function that restricts 
the new state of a cell only as a function of adjacent cells. It is, however, convenient to allow nearby 
cells to see each other regardless of whether they are on the same lattice or not. We extend the 



notation i^j{x) introduced in Subsection 7^. Let us fix 77. For a vacant site x at time t there is at 
most one cell at distance greater than but less than B to the right of x: if it exists and there is no 
damage between it and x then we denote it by tJq.sI^;, t, rj). Otherwise, 1^0.5(2^1 v) is undefined. The 
notation i^-o.s is defined analogously. We will omit ry, and sometimes even from the arguments 
when it is obvious from the context. By convention, whenever dj{x,t^rj) is undefined then let 

r]{'dj{x,t,r]),t) = Vac. 

For a (nonvacant) cell x there is at most one cell at distance greater than B but smaller than 2B to 
the right of x: if it exists and there is no damage between it and x then we denote it by 

^i.six). 

Otherwise, this value is undefined. The corresponding cell to the left is denoted by i?.! 5(2;). The 
medium depends on a parameter r called the reach, determining the maxiumum distance of the 
neighbors that a cell is allowed to depend on directly. (We will only use two possible reaches: r = 1 
and r = 3.) Let 

(8.2) NbJndr = {-L5, 1.5} U {-r, -I- 1, . . . , r - 1, r}. 

A function u: NbJndr — > S will be called an assignment. For an assignment u, we will write Uj for 
the value of u on j. The transition function in a robust medium has the form Tr(u) where u runs 



over assignments. In analogy with (2_^), let 

TF(77, x, t) = Tr(u) 

where u is the assignment defined by Uj — rj(dj, t) with -dj — ^j{x, t, rf) except that if is defined 
for some noninteger j, then then u\j = Vac for all A > 1. Thus, when a closest neighbor 'dj is not 
adjacent then the transition function does not depend on any farther neighbors in that direction. 77 
will be omitted from Ti{rj,x,t) when it is obvious from the context. 

8.2.2. Properties of the transition function. A robust medium will be denoted by 

J?ob(Tr('"\B,T.,r*,£,£',r). 

The set § of states is implicit in the transition function. Similarly, the subset Bad of states will be 
defined implicity by the transition function: this will be the states such that if any argument of the 
transition function is in this set, the function value is not defined. The first bit of the state of a cell 
will be called Rand and we write Dct ~ All \ Rand. There is also a subfield Color C Det defined 
implicitly by the transition function: this is, say, the set of those first bits of Det that turn to 
when Tr is applied to the vector with all Vac values. (Color plays role in self-organization.) 

The transition function, a mapping from assignments to §, describes the goal for the deterministic 
part s.Det of the value s of the state after transition. We will see that the field Color will not 



necessarily obey the transition function. We will assume that Tr is separable in the sense of [2.2.1 



with some bandwidth w. Let the fields Rand and Color be part of Inbuf: thus, they can be seen by 
a neighbor cell in a single step. The separability requirement also defines a predicate legaf^' (m, v). 
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Our transition functions in robust media will also have the following properties: 
Condition 8.5 (Time Marking), li s Vac then legal{s,s) — 0. (} 



This condition is similar to (6.17) and says that a nonvacant cell always tries to change its state. 
It is necessary for nontrivial media with a strict dwell period upper bound. 

Let us call a pair (j, v) G { — 1, 1} x S, a potential creator of state s ^ Vac if s = Tr(u) provided 
Mi = w for i = j and Vac otherwise. In a space-time configuration rj, at a switching time a of cell 
X when r]{x,t) turns from vacant to non- vacant, we call y = 1?^ (x, ct— ) a creator of x for time a if 
{j,rj{dj{x,cf—))) is a potential creator of 77(0::, tr) and further ri{x,t) — Vac, r]{y,t) = r]{y,a—) for 
te [c7 -T,/2+,a-]. 

Condition 8.6 (Creation). 

Let u be an assignment such that uq = Vac, Tr{u).Det Vac.Det and in which there is a j in 
{— 1, 1} with Uj Vac. Then there is a j G { — 1, 1} such {j, Uj) is a potential creator of Tr(u). (} 

This condition says that if, for some ti < t2, a vacant cell x comes to life at time t2 due to 
"observation" at some time ti when there is a neighbor ■dj{x,ti) nearby then an adjacent neighbor 
at time ti can be made completely responsible for the state of the new cell at time the state 
would be the same even if all other sites in the neighborhood were vacant. 

If u is the neighborhood state assignment vector consisting of all vacant states then let Newborn ~ 
Tr(u) be the canonical newborn state. A state s is a newborn state if 

s.{Det \ Color) — Newborn.{Det \ Color). 

Note that a newborn state is allowed to be somewhat less determined than others: this property 
will be used in self-organization. 

Condition 8.7 (Cling to Life), li Tr{ri,x,t) = Vac then r]{-dj{x,t),t) / Vac for some j G { — 1.5,1.5}. 

Thus a cell will only be erased if it may disturb a close non-aligned neighbor. 
8.2.3. The computation axiom. The condition called the Computation Property has a form similar 



to the definition of primitive variable-period media in Subsection 3.2 . For a space-time configuration 
r], cell X and rational number a > let 

CTl, 0-2, CTo 

be defined as follows. (Ti,a2 are the first two switching times of x after a but defined only if 
r]{x, (Ti) 7^ Vac. On the other hand, (Tq is the first switching time of x after a + T* but defined only 
if ri{x, a + T') = Vac. Whenever we have an event function g{a, W, rf) in whose definition 172 occurs, 
this function is always understood to have value if 172 is not defined (similarly with (Tq)- Let 

Wq{x, a) = {x] X [a - T./2+, a + 2T*], 

Wi{x,a) = [x - [r + l)B,x + {r + 2)B-] x [a - T, /2+,a + 2T'], 
fj{x, a, rj) — the event function for { Damagciji) n Wj{x, a) = } {j = 0, 1). 

Condition 8.8 (Computation Property). This axiom consists of several condition types. For each 
type a used in this axiom except a{rand, j), we have 6(a) = 0, i.e. the corresponding events 
g{o:{), W,r]) are simply prohibited in the trajectory. On the other hand, 

b{a{rand,j)) = 0.5 + e' (j = 0, 1). 
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Figure 15. To the computation axiom. The large rectangle is Wi{x,a). The 

rectangle of the same height but width at point x is 14^0(2;, a). The small rectangle 
between times cti and a2 is the cell work period under consideration. Its lower part 
is the observation period. 



(a) This condition requires coin-tossing, for j = 0, 1: 

g{a{ra.nd,j), Wa{x,a), rj) = Ja{x,a,7]){j]{x,a2).Rand = j}. 

(b) This prohibits dwell periods shorter than T, or longer than T* . Let 

g[dw_pj)d, Wo(a;,a), r;) = fo(x,a,ri)h{dw_p_bd,x,a,rj) 

where h{dw-P-bd,x,a) = 1 if 77 has a dwell period shorter than T, in Wo{x,a) or has a dwell 
period longer than T* there (and, as always, otherwise). 

(c) This says that whenever Wo{x, a) is damage- free the transition at 0-2 is a legal one. Let 

g{legaLcomp, Wo{x,a), 77) = /o(.t, a, r?)(l - Jegai(r?(a;, C72-), 7?(a;, 0-2))). 

(d) This says that whenever Wo{x,a) is damage- free the transition in (Tq is a legal one. Let 

g{ legaLbirth, Wo{x,a), rj) = /o(a;, a, 77)(1 - ]egaJ(?7(a;, ctq-), r7(x, cro))). 

(e) This says that whenever Wx{x, a) is damage-free the transition function applies, at least as much 
as it can, based on observation at a certain time during the observation period ("atomicity"). 
Let 

g[trans, Wi{x,a), rj) = f\{x,a,rj)h{trans,x,a,rf) 
where h{trans, x, a,r]) = 1 unless there is a e [(Ti+, a2 — T,/2] with 

r]{x,a2).Det = Tr{r],x,t').Det. 

(f) This says that if x always has a neighbor during the whole interval of interest before its birth 
then it has a creator. Let 

g[creator, Wi{x,a), rj) = fi{x,a,r])h{creator, x,a,r]) 
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where h{creator, x, a, rj) = 1 unless either x has a creator for tune ctq or there is a G [a+jdo] 
such that ■q{-dj{x,t'),t') = Vac for aU j e {-1,1}. 

(g) This says that if x does not have a creator accounting for its birth then it is a newborn with 
vacant neighbors at some time shortly before birth. Let 

g{newborn, Wi{x,a), rj^ — fi{x,a,ii)h{newborn,x,a,ri) 

where h{newborn, x, a, 77) = 1 if a: has no creator for time uo, and either r](x, ctq) is not newborn 
state or there is no t in [(Jq — T,/2, ctq— ] with ri{'dj{x, t),t) = Vac for all j G NbJnd. 

(h) This says that x cannot stay vacant if it has would-be creators for a long time and has no 
neighbor blocking creation. Let 

g(^no_birth, Wi{x,a), 77) — fi{x,a,r])h{no_birth,x,a,r]) 

where h{no_birth,x,a,ri) = 1 if 

(a) ri{x, t) = Vac for aU t e [a+, a + 2T']; 

(b) for each t G [a+, a + 2T'] there is a j G { — 1, 1} such that (j, dj{x, t)) is a potential creator 
of some (non- vacant) state; 

(c) there is no {y,t) with Q<\y-x\<B,t'^ [a+, a + 2T'\, r]{y,t) ^ Vac; 



Example 8.9 (Special cases). To obtain a constant-period medium as a special case of robust media 
set T, = T*, r = 1. To obtain a deterministic cellular automaton, set also e = 0, require that 
the space-time configurations have empty damage and that the transition function does not give a 
vacant value. 

The connection between primitive variable-period media and robust media will be set up using a 
trivial simulation. Let Mi ~ Prim_var{Tr,B,T,,T*,e). We define a simple simulation $* by this 
medium of the robust medium 

M2 = Rob{Tr,B,T,,T',e,e, 1). 

M2 has almost the same state set as Mi except that it also has at least one extra state, making the 
set Bad nonempty. We give the definition of s = rj*{x,t) — ^*{rj){x,t) for each 77. If x is not an 
integer multiple of B then s = Vac. Else s — rj{x^t) unless 77 violates the Computation Property of 
M2 at x with (72 = t: in that case, it is in Bad. It can be verified that this is indeed a simulation. () 



2.4. Error- correction. The error-correction property can be defined here just as in 3.6.2| , replacing 



Tj with r,,j. Then a lemma analogous to Lemma 3.7 can be formulated, and a proof for Theorem 6.13 



can be provided that is analogous to the proof of Theorem |2.5. 
8.3. Simulating a medium with a larger reach. 

Theorem 8.10 (Reach Extension). For all r and all T12 with reach r, for all T, < T' let A = 
\T'/T,-\, 

C/ = 64A((A + l)r-M)(r + 2). 

There is a set §1, a one-to-one mapping s i— > s* from §2 to §1 giving rise to the code Lp^{S^){x) = 
{£,{x))<,, functions Tri() of reach 1, and for all B,s,e', a decoding <&* such that ($*,(p*) is a simu- 
lation of 

M2 = RobiTr2,B,UT„UT',Us,URs',r) 

by 

Ml = RobiTri,B,T„T%e,e',l). 
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The basic idea is of the proof is to replicate the state of a whole reach-r neighborhood of a cell 
of M2 in a single cell of Mi . It will take then a work period size U that is at least proportional to r 
for a cell of Mi to simulate a step of a cell of M2 since the cell must learn about neighbors r steps 
away. In fact, due to asyiichrony, the work period will be significantly larger. Some complication is 
due to the need to achieve the "atomicity" property, namely that a single observation time can be 
assigned to a transition of Mi even though the observation occurs over an extended period of time. 
The solution for this is to make sure that the observed values coming from the neighbor cells do not 
come from near the end of the work period of those cells. 
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9. Amplifiers 

In the present section, when we talk about media without quahfication wc understand robust 
media. 



9.1. Amplifier frames. The eventual goal is to find an amplifier (Mfe, $fc) with a fast decreasing 
sequence Ek and we also want to simulate a computation with transition function Univ and 
damage probability bound Ek- In this subsection, we impose some conditions on the parameters of 
a sequence (Mk) of media that are sufficient for the existence of an amplifier. These conditions are 
similar to the inequalities in the conditions of Theorem 7.13. 



9.1.1. The rider fields. All robust media Mk have, by definition, the fields Det and Rand whose 
behavior is governed by the computation and restoration properties. Only Det will be subdivided 
into subfields: therefore subfields rj.Det.F will be denoted simply as ry.F, without danger of confusion. 
In the media M'^ we discuss further, there will be a shared field called Rider^ with guard field 



Guard'^ (see Subsection 4.2). The useful computation of Mk will be carried out on the "active level" 



k, on the track Rider'' . The simulations will have sufficient freedom in the rule governing Rider'' on 
the active level to regain the universality lost by having no "program" for the whole simulation (as 
discussed in Subsection 4.3). We will say that in a robust medium M, the transition function Tr, 
with state set §, is combined from the rider transition function Rd-trans and simulation transition 
function Sim_trans with fields Rider and Guard, if the value s — Tr(r) is defined in the following 
way. If Sim_trans{r) = Vac then s = Vac. Otherwise, all fields of s disjoint from s. (Rider U Guard) 
are defined by SimArans(r) . Further 



(9.1) 



s. Rider = 



s. Guard 



Rd-trans{r) .Rider if r^. Guard — 0, 

Sim_trans{r) .Rider otherwise, 

Rd_trans{r). Guard if tq. Guard > 0, 

Sini_trans(r). Guard otherwise. 



Thus, the rider field is controlled by its own transition function only when the simulation does not 
command to eliminate the cell, and when we are on the active level according to the guard field. 
The guard field will be controlled by its own transition function in case we are not below the active 
level (where the simulation transition function will enforce the broadcast property for it). 



9.1.2. Broadcast amplifier frames. The string prog will be called a uniform program (with complexity 
coefficient c ) for the functions fk if there is some constant c such that it computes on the universal 
computing medium Univ the value /fc(r) from /c,r with space- and time-complexities bounded by 
c(logfc-|- |r|) where |r| is the total number of bits in the arguments r. If a sequence {fk) of functions 
has a uniform program it is called uniform. As a special case, a sequence of constants Ck is uniform 
if it is computable with space- and time complexities clogfc from the index k. 

Here, and always from hence, when we refer to a natural number A: as a binary string we always 
mean its standard binary representation. We will distinguish the simpler notion of a "broadcast 
amplifier frame" from the full-featured "amplifier frame" . The simpler notion is sufficient for the 
basic non-ergodicity result. A broadcast amplifier frame Frame is given by sequences 

Capf.,Qk,Uk, 

and the constant i?o satisfying certain conditions. Here, Qk is the number of cells in a colony. For 
an appropriate sequence i^fe > 1, the number of dwell periods in a work period will be allowed to 
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vary between Uk/vk and Uki'k- Given a frame Frame, let us define some additional parameters as 
follows, with some arbitrary positive initial constants T'l > T^i and e = Si,e[ < 

Bk = l[Q^, 

i<k 

i/fe = 1 + RoQk/Uk if T.i ^ T'l, and 1 otherwise. 

k-l 



(9.2) 



k-1 
i=l 

2 



Sk+i = 2b{QkUkVk£k) , 
e'l = AQkUkVkSk, 

k-l 

^'k — s'l + ^ ^'i- 



The formula for fffc+i is natural in view of Lemma B^. The definition of e^^^ takes into account 
the limited ability to simulate a coin-toss with the help of other coin-tosses, e'l is used for the 
error-correction property: it is essentially the probability that there is any fc-level damage at all 
during the work period of a colony of fc-cells. The requirements below defining an amplifier frame 



can be compared to the corresponding conditions for Theorem 7.13 , 

An object Frame given by the above ingredients is a broadcast amplifier frame if the conditions 
listed below hold. 

Complexity Upper Bounds: All parameters in Frame are uniform sequences with complexity 
coefficient Bq; 

Bandwidth Lower Bound: We should be able to deal with numbers comparable to the size of 
the colony and the work period within a cell: 

(9.3) Capk > RologUk-, 

Capacity Lower Bound: The colony must represent the state of the big cell with redundancy: 

(9.4) i?oCapfc+i < QkCap^. 

Work Period Lower Bound: There must be enough dwell periods in a work period to perform 
the necessary computations of a simulation, with some repetitions: 

(9.5) Uk>Ro{\ogQk + \ogk)Qk. 

The factor logQ^ -flogfc is needed only for technical reasons, in the proof of self-organization. 
Error Upper Bound: The following upper bound on QkUk implies, in view of the definition of 
the error probabilities, that these decrease exponentially: 

(9.6) < Ro/QkUk. 
Time Stability: 

(9.7) ^ < 3. 

^»k 



RELIABLE CELLULAR AUTOMATA 



77 





-1, 




= 2, 


Qk 




Uk 


= 


Capk 


= c(logfc V 1) 



This inequality (which is made this strong for simphcity) can be achieved e.g. with Uk > ck^Qk 
for some sufficiently large c. 

The definition of £k+i and ( |9.7| ), implies 

Ek+i = 2h{QkT\+^/T\fel < 25{2QkUkfel < 32sl-y{Rof < el'^ 

hence 

(9.8) £fc<e'-''". 

From here, it is easy to see that e'^ also converges to with similar speed. 

Example 9.1. Let us choose for some c > 1 



(9.9) 



Conditions ( |9.3| ) and ( |9.5| ) are satisfied for large c. 

There is a slightly better bound on Sk here than in ( |9.8| ). 

(9.10) Efc+i < 25{2QkUkfel = 32c'''k'hl. 
Let us prove, by induction, for small enough e, 

(9.11) < e2'=-^+('=+i)/4. 

For fc = 1, the statement gives e < e. For fc > 1, using ( ]9.10D and the inductive assumption, 

For small enough e, the last factor is less than 1. (} 

9.1.3. Amplifier frames. The above definitions must be modified for the notion of an amplifier frame, 
(not needed for the basic non-ergodicity result). An amplifier frame Frame is given by the same 
sequences as a broadcast amplifier frame, and some additional sequences 

qk,Wk, Rd-transk () , Output'' ,Ri{k). 

Here, Rd_transk{) describes a rider transition function with bandwidth < Wk, the field Rider'' 
has subfield Output'' on which this function has monotonic output, and the number Ri{k) is a 
redundancy coefficient. Each cell will have space 

i<k 

for the represented information. An object Frame given by the above ingredients is an amplifier 
frame if the conditions listed below hold. 

Complexity Upper Bounds: The sequence Rd-trans'l!^''\) of functions (see ( |2.5| )) and all 
other parameters in Frame are uniform sequences with complexity coefficient Rq; 

Bandwidth Lower Bound: We should be able to deal with numbers comparable to the size of 
the colony and the work period within the time and space bound Wk- 



(9.12) Wk>logUk; 
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Redundancy Lower Bound: The redundancy must help dupUcate the content of at least a 
certain constant number of cells: 

(9.13) >i?o/Qfe. 

Capacity Lower Bounds: The capacity Cap^. of a cell should accomodate all the original in- 
formation 2_B^ (2 bits per primitive cell). 

(9.14) 2B'^ < Cap,. 

The colony must represent the state of the big cell with redundancy. It also needs extra space 
for computing with a few bandwidths of information of the big cell. 

(9.15) (1 + Ri{k))Capk+i + Rowk+i < QkCap^. 

Work Period Lower Bound: There must be enough dwell periods in a work period to perform 
the necessary computations of a simulation: 

(9.16) Uk>RoQki^^+logQk + logk) 

Wk 

It costs QCapi./wk to perform Q steps of simulated computation of Univ when each track of 
size Wk must be worked separately. The other terms in the parentheses are typically of lower 
order and are needed only for technical reasons, in the proof of self-organization. 
Error upper bound, Time stability: These conditions are the same as the ones for broadcast 
amplifier frames. 

The inequality (D.15) has the following consequences, similar to ( 4.32| ): 



Lemma 9.2. 



E 



Wk+l 



k ^^=+1 



< oo. 



Thus, as expected, if we have constant space redundancy i.e. Capf.^i/ Bk+i is bounded away 
from then X]fc-^i(^) ^ redundancies of each level form a converging series. This will 

then impose, via (9.16), a lower bound on the time redundancy achievable by these amplifiers. 



Proof. Inequality (9.15) can be rearranged as 

(9.17) nm^^ + < ^ - ^ 



□ 



Let us remind the reader that the constructions involving the Rider and Guard fields are not 
needed for the simple non-ergodicity results. 
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Example 9.3. Let us choose for some c > 1 



(9.18) 





= 1, 


T\ 


= 2, 


Qk 




qk 


= Qk, 


Ri{k) 


= c/Qk = l/(cfc2), 


Uk 


= c3fc4 = cQfc/i?i(fc), 


Wk 


^Ri{k)Bk, 


Capfe 


= 2Bfc(l + l/fc). 



The transition function Rd_tranSk will be the aggregated function Tr yj^^^_as defined in Example |3.4| . 
This satisfies the re quired complexity bounds. Conditions ( 9.12 ) and ( 9.13 ) are sat isfied with large 
c. Condition ( |9.14 ) is satisfied by the definition of Cap^.. It is easy to check that (3.15) also holds 
with large c. Condition (9.16) will be satisfied with large c. The error upper bound is satisfied just 

again. 



as for the broadcast amplifier example, and we have (9.11 



The parameters of this example are chosen for constant space redundancy, at the price of time 
redundancy that comes as an extra factor of Capf./wk at every level. 

For later reference, here is an explicit expression falling between T,k and T'k- 



(9.19) 



i<k 



((fc — 1)!)"* = g4fclogfe+0(fc)^ 



The expression for Bk is similar. <^ 

9.2. The existence of amplifiers. We give a separate, simpler track of definitions for broadcast 
amplifiers. A (uniform) broadcast amplifier is given by a broadcast amplifier fram e F rame, uniform 
sequences (Sim^transk) , {Mk, $fe)fc>i with <^k = (^fc, <^fc*) as defined in Subsection ^.6| and a uniform 
sequence of codes i^k** with a sequence of fields {F^) such that 
(a) 

(9.20) Mk = Roh{Trk,Bk,T,k,T\,ek,e'k); 

(b) {F^) forms a broadcast field for both ((^fc*) and {ipk**), as defined in Subsection O 

(c) Simulation $fc has e'j!-error-correction for ^pk**', 

(d) The damage map of the simulation $fc is defined as in Subsection 



8.1 



(The last condition explains the definition of £a;+i-) With Lemma 8.4, it will imply the Restoration 
Property for the simulated trajectory. 

The following definition is not needed for the simple non-ergodicity result: A (uniform) ampli- 
fier is given by an amplifier frame Frame, uniform sequences (Sim.tranSk), {Rider ), {Guard ), a 
hierarchical code 



(9.21) 



* = (Sfe , Qfc , (Pfc, , Rider'' , qk , 



Ik, ak)k>i 



as in 4.13 and an amplifier {Mk, ^k)k>o where $fc — ($^,, fk*) is defined as in Subsection 3.6, and 
a sequence of codes ipk** and fields Guard'' such that 

(a) {Rid er'') , and {Guard'') form a guarded shared field for {(pk*), (Vfc**)j as defined in Subsec- 
tion U; 

(b) the damage map of the simulation $fc is defined as in Subsection 

(c) Trk is combined from Rd_tranSk and Sim_tranSk, with fields Rider , Guard''; 
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(d) Affe is as in ( |9.20| ); 

(e) $fe has e'^'-error-correction for (/?fe**; 

Most of our effort will be devoted to proving the following lemma. 

Lemma 9.4 (Amplifier). Every (broadcast) amplifier frame Frame with large enough Rq can be 
completed to a uniform (broadcast) amplifier. 

9.3. The application of amplifiers. 



Proof of Lemma 3. ''/ . We will actually prove the more general variable-period variant of the lemma, 
with Tk 



Let us define a broadcast amplifier frame Frame e.g. as in Example 9.1. Applying the Amplifier 
Lemma in the broadcast case, we obtain a broadcast amplifier with media A/^ as in (9.2C) with a 
hierarchical code having a broadcast field (F*^) for both ((^3^*) and {ipk**)- Let us denote the whole 



system of code and the broadcast field (with the additional mappings 7fc) by 4" as in ( 4.11 ). For any 
value ui of the field F^, we create an initial configuration ri^{-, 0) — T{ui] ^) as in (4.12). 

Then all properties but the initial stability property of an abstract amplifier (defined in 
Lemma 3.7) are satisfied by definition. For the initial stability property it is sufhcient to note 
that each medium is a robust medium, with work period lower bound T.^.. Therefore, if rf' 
is a trajectory of and t < T,^ then for each x the probability that rj''{x,t) ^ ri''{x,0) is less 
than the probability that damage occurs in {x} x [0,T,j.]. This can be bounded by the Restoration 
Property. □ 



Proof of Theorem 6.14 



1. This proof starts analogously to the proof of Lemma B.7 



Let us define an amplifier frame Frame as in Example 9^, with the rider transition function 
having the property 

Rd.transk{r) — ro, 

i.e leaving all fields (in particular, the rider and guard fields) unchanged. Applying the Amplifier 
Lemma, we obtain a uniform amplifier with media Mk as in ( |9.2(]| ) with a hierarchical code having 
a guarded shared field Rider with guard field Guard as in ( |9.2l| ). Let K be largest with Bk < N. 
For any infinite configuration g e {Si.Rider^)^, we create an initial configuration r]^{-,0) = T{g) 
with a guarded shared field (F'"', Guard''), with active level K. As mentioned after ( 4.30| ), if N is 



finite then r{g) = r{g;K). Let the trajectories ry^, rj'' be defined as before. Then we will have 



Tj''{x,0). Guard'' = -1 for all x, for fc < if and tj^ {x,0).Guard^ = 0. 

Let xi — X{y) for a y in Visible{K{N), N) (since the code was chosen such that 
\Visible{K{N), N)\ — N, this is not really a restriction on y), and let ti > 0. Remember the defi- 
nition of the aggregated input configurations g'' in (4.21) and of X{y,i; k) in (4.19). Eventually, 
we want to estimate the probability of g{y) = ri^{X{y), ti). Rider but we will use a generalization 
corresponding to 



261). Let 



Xk = Ok + X{y, k), x'^ = o'^+ X'iy, k) 
1 < k < K, then (4.20) shows that Xk+i is a cell of ri''~^^{-, 0) whose body contains Xk with address 



y'f. where y^. was defined before (4.19). Let Tk be the event that 
(9.22) T]''{xk,t). Rider'' = g''{x'k) 

T,k/'^, ti]- 



and r]''{xk,t). Guard = 
2. We have Prob{ J^k+i n 



-1 holds for t in [ti 
■^^k } < si 
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Proof. Assume J-'k+i- Then, according to the error-correction property, except for an event of 
probabihty < e'jj, for every t in [ti — r,j,/3,ti] there is a t' in [ti — T,j^^i/3,t\ with 

T^'^ixk, t).Rider'' h ipk**iv''^Hxk+i,t'my'k).Rider\ 

Since (Rider^) is a guarded shared field and r]''^^{xk+i,t).Guard'''^^ — —1 for all t in ti + 
[— T,fc_^]^/3, 0], we can replace (pk** with (pk* in the above equation. J-'k+i implies 

rj'^+\xk+ut').Rider''+' = g'^' {^'k+i) ■ 



The identification property and the aggregation property (4.22) implies 

ipUv'+\^k+i,my',).Ridcr'' = g'^ix',). 

Thus, except for an event of probability < e'^, we have J-k- 
Let 

n = /\{k -.ti < or fc = }, 

t2 — ti — T,„/3. 



3. Consider the case t2 < 0. 



]2|) holds for k = n, t = 0. Since the 
, t) undergoes any change during this 



By the construction and Proposition 4.7 the relation 
duration of [0,ti] is less than r,„ the probability that 77" (x„ 
interval is less than e„, proving Prob{ -^J-n } < £«• 
, Consider the case t2 > 0: then n = K < 00 and the space is finite. 

According to the definition of in (4.27), and of r/^(-, 0) = 0) in the same proof, the value 
ri^{oK + vBk mod N, 0) is defined for y in [0, \_N/Bk\ — !]• Thus, at time the space is filled as 
much as possible with adjacent cells of -q^ one of which is xk- The set R = "Ln x [0,^2] can be 
covered by at most 

T^\N/BK^\2t2|T,K^■ 

copies of the rectangle Vk- Since -q^ is a trajectory of the robust medium M/f , the probability 
of damage on each of these rectangles is at most Ek- Therefore the probability that there is any 
damage in 77^ over R is at most tek- Assume that there is no damage in r]^ over R. Then 
the cling-to- life condition and the computation condition imply that all cells of r]^{-,Q) remain 
nonvacant until time ^2- We defined RdJ^r&nsKi) to leave the rider and guard fields unchanged, 
and r]^ {x,Q).Guard^ — 0. Hence, by the definition of "combined" in Subsection 9.1, for each of 
these cells z we have 

rj^{z,t2).Rider^ = if {z,0).Rider^ 
- = K. Hence the total probability upper bound is 



implying (9.22) for k 



fe=i 



where we used N < Bk+i 
for small enough e, and < B^+i 



for some constant C2. 



BkQk- With the parameters of Example 9.3, we have £kQk < £^ 

^2K\ogK+o{K)^ hence EkQk can be written as e''^^^) with 



h2iN) = Arc2/iogiog7v 



□ 
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Proof of Theorem 6.11: We proceed in analogy with the proof of Theorem 3.14 using the same 
notation wherever appropriate and pointing out only what is new or different. We will find out by 
the end what choice for the functions ho{t) and hi{t) works. One can assume that Tr is commutative 



in the sense of Subsection 7.1 since the methods of that subsection can be used to translate the 



result for arbitrary transition functions. The transition function Rd-tranSk() -Rider on the rider 



track will be defined to be the aggregated transition function Tr^'' , as in Example |3.4| , and the 
Rd_tranSk{)-Guard'^ will lea ve the guard field unchanged. With this, an amplifier frame will be 



obtained as in Example 9.3, The Amplifier Lemma gives a uniform amplifier with media Alk as 



in ( 9. 20] ) with a hierarchical code having a guarded shared field {Rider'^), {Guard'') as in (9.21) 



The parameters of that example are chosen in such a way that B'^. = B^. and therefore 
(9.23) \Visible{K,N)\ = N. 



Let C be any trajectory of Tr over Z with C(':0) of the form Init{g) (see (5.4)), satisfying 
2\Supp{(^ , t)\ < s, where g G {Ei.Ridcr^ .Input)^ . For any s and ti > 0, let a sufficiently large 
K be chosen: we will see by the end of the proof how large it must be. We certainly need 

(9.24) SuppiCt) c Visible{K,N) 



which, in view of ( 9.23 ), can be satisfied if > s/2. Let 77^(-,0) — r{Init{g); K). The trajectories 
rj^, rf' are defined as before. Let x\ = X{y) for a ?/ in Visible{K, N). Let C'^(x,t) be the trajectory 
of the aggregated cellular automaton CA{Tr^^ ,B'f.,B'^) with the aggregated initial configuration 
tB^,(C(-,0)). Let 

(9.25) u^\[t2lT'K\-l\+. 
Let Tk be the event that 

(9.26) rj'' {xk,t).Rider'' .Output h {x'k,u). Output 



and r]''{xk,t). Guard'' — —1 holds for t in [ti — r,j./3,ti]. 

1. We have Prob{ Tk+i n -^Tk } < e'^. 

Proof. Assume J-k+i. According to the error-correction property, except for an event of proba- 
bility < e'l, for every t in [^i — T,k/3,ti] there is a t' in [^i — r,^,_|_i/3, i] with 

r^''{xk,t). Rider" h ^ k**{^''+\xk+i,t')){y'k). Rider" . 

Again, we can replace (/^fc,, with Lpk^, here. Tk+i implies 

r]''+^{xk+i,t').Rider''+^. Output h C'^^^ (4+i, «)• Output. 

Hence, the identification property and the aggregation property ( 4.22| ) implies 



'^k*iv''^^ixk+i,t')){y'k).Rider''. Output h C''ix'k,u). Output. 
Thus, except for an event of probability < ej!, we have 

ri'' {xk,t).Rider'' .Output ^ {x'^,u). Output. 

2. Consider the case ^2 < 0. 

Then the statement of the theorem holds since the ({■, 0). Output is filled with *'s. 

3. Consider the case t2 > 0: then n = K < oo. 
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According to the definition of ?7^(-, 0) = 0), at time 0, the space is filled as much as possible 
with adjacent cells of r]^ one of which is xk- Let us define the intervals 

/ = lj{ Ck{y) + [0, Bk - I] ■■ C'^iy) is in any support of }, 
J = / n .Ti + BK{t2/T,K + !)[-!, 1]. 
The set i? = J x [0, 12] can be covered by at most 

r = \\J\/Bk] \2t2/T,K-\ < 2{t2/T,K + 2) \2t2/T,K] ■ 

copies of the rectangle Vk- Since 77^ is a trajectory of the robust medium MK^ the probability 
of damage on each of these rectangles is at most ek- Therefore the probability that there is any 
damage in 77^ over R is at most rex- Assume that there is no damage in 77^ over R. Then 
the cling-to-life condition and the computation condition imply that all cells of 77^ (-,0) remain 
nonvacant until time ^2- 

We defined Rd.tranSkQ to leave Guard'' unchanged, and to be the aggregated transition func- 
tion Tr^'', on Rider'' . We have r]^ {x,0).Guard^ — 0. Hence, by the definition of "combined" 



in Subsection 9.1, each cell in each of its transitions during the damage- free computation, applies 



Tr^^ to the field Rider^ , making at least [t2/T*K\ — 1 steps. By ( |9.24 ), / contains the image 



of a support of , therefore the confinement to this interval does not change the result. Thus 

r]^ {x,t2).Rider'^ .Output h {x,u). Output, 
and thus J-'k holds. This gives a probability upper bound 

oc 

+ 2eK{t2/T.K + 2)\2t2/T,K^. 

k=l 



Using ( 9.25 ) and (9/7) (and ignoring integer parts and the additive 1 and 2) this can be upper- 
bounded by 

00 
fe=i 



For small enough e, by ( ^.11 ), the second term is < u^e"^ , and hence 

K > ho{u) = log log M 0(1) 
will do. This gives T*k = e''^'°s^+c(^) < (iogu)5i°gi°gi°g" for large enough u showing that 

h > hi{u) = u(logu)5'°siogiog« 

will do. 

□ 

Remark 9.5 (Non-ergodicity without the error-correction property). The paper mentions no ex- 



plicit error-correction property in the proof of Theorem 2.5. Rather, it uses the fact that the fields 
Age and Addr have a kind of automatic error-correction property. In this way, we indeed arrive 
at several different invariant measures. However, one of these two measures may just be a shifted 



version of the other one, allowing the medium to be "forgetful" in the sense of Subsection 5.1. 
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10. Self-organization 
10.1. Self-organizing amplifiers. Consider a robust medium 

(10.1) M = Rob{Tr^'"\B,T„T',e,e',r). 

In what follows, it is convenient to speak of a distinguished value of the field Color: let it be called 
"blue" . But the definitions and statements hold for every value of Color. 

When a cell turns from a vacant to a non-vacant state, we can distinguish two ways in which this 
can happen. The first one is if the cell has a time with no non-vacant neighbor within T,/2 before 
the event; let us call this spontaneous birth; the second one is when it does not: let us call this 
creation. Under certain conditions, spontaneous births are to be prevented, since they may give rise 
to the wrong kind of cell. One way to achieve this is to populate an area tightly with cells. 

For an interval / not consisting of a point, let 

(10.2) r{I,d) = {x : [x -d,x + d]nl 

We will also write T[x,d) — [x ~ d, x + d—]. In a space-time configuration 77 of M, a point {x,t) 
is said to be controlled if r{x,B) x [t ~ 6T*,t] contains a cell. It is blue if all cells in this area are 
blue. A space-time set is controlled or blue if all of its points are. A medium has the lasting control 
property if for all of its trajectories, for all sites xq and times ti < t2, with 

/ = r(xo,((t2-ii)r7r. + 4)B), 

if J X {ti} is blue and I x [ti ~ 2T', t2] is damage-free then {xq, t2) is blue. Let Mi, M2 be robust 
media, and $ a simulation between them. We say that this simulation has control delegation property 
if the following holds for all sites Xq and time ti with 

I = rixo,{8T'2/T,2+A)QB). 

Let 77 be a trajectory of Mi. If I x {ti — 8T'2} is blue in ^*{ri) and the area I x [ti — 16T'2,ti] is 
damage-free in 77 then (a;o,ti) is blue in rj. 

For some parameters D,a > 0, we will say that the trajectory (yit, rj) of medium M is {D, a)-blue 
at time t if the following holds. Let Ii, . . . , /n be any system of intervals of size D such that T{Ij , D) 
are disjoint. Then the probability tha t eac h of the sets Ij x [t — 3T',t] is non-blue is at most cr". 



Let us be given an amplifier as in ( 9.21 ), constants Ci, C2, ki, and a sequence > with 

CTkUk 0, 

) 3) 

^Tfc+i > RoiUkSk + QWk + e-^^'^^'Q''). 



We will call this a self- organizing amplifier if the following properties hold: 

(a) all media in it have the lasting control property; 

(b) each of its simulations has the control delegation property; 

(c) for every fc, every time t, every trajectory (/i,/;) of that is (CiBfc, (Tfc)-blue at time t, the 
trajectory {ij.,^l{r])) of Mfc+i is (CiBfc+i, o-fc+i)-blue at time t + C2T'k+i; 



Example 10.1. To the choices of Example 9^ , let us add the choice ak — c Let us check ( 10.3 ) 

Its right-hand side is 



Given the bound e^-^ on et for > 1, (see (|9.8D), this is less then c if c is large enough and 

£ is small enough. <0 
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Lemma 10.2 (Self-Organization). Each amplifier frame Frame with large enough Rq and 

small enough e can be completed to a self-organizing amplifier with the property that if 
Rd_trans{r). Guard = — 1 for all r then Sim_trans{r). Guard — —1 for all r. 

10.2. Application of self-organizing amplifiers. 



Proof of Theorem 6.11 . 1. This proof starts analogously to the proof of Theorem 6.14 . 

Let us define an amplifier frame Frame as in Example 9^, with the rider transition function 
having the property 

Rd.tranSkir). Rider — ro. Rider, 

Rd-transk{r). Guard = — 1. 

i.e. leaving the Rider field unchanged and always setting the guard field to —1. It will have a 
broadcast field Color with m bits. Applying Lemma 10.2, we obtain a self-organizing amplifier 



with media Mk as in ( 9.20 ) with a hierarchical code having a guarded shared field Rider , Guard 



Also, we will have Sim_trans{r). Guard = —1 for all r. Let 

K = sup{A: : 5Bk <N}. 

For a certain value of the field Color that we call Blue, we create an initial configuration r/^(-,0) 
consisting of all blue latent cells covering the whole space, with Guard — —1. Let the trajectories 
r]^, T]^ be defined as before. Let {xi,ti) be a space-time point. Eventually, we want to estimate 
the probability of Blue — ri^{xi, ti).Color^ . 

Let 

h = {xo}, 
Vl = ti, 
Vk+i = Vk - ST'fe+i, 

4+1 = r(4., (8r%+i/T.fe+i + 4)5^+0, 

Ml = 0, 

Uk+1 = Uk + C2T'k+i. 

li k < K then by the self-organizing property of the amplifier, the evolution r/'' is [CiBk, (Tfc)-blue 
at time Uk for each k. Let J^k be the event that Ik x {w/t} is blue in rf . 

2. There is a constant ki such that 

Prob{ J^fc+i n -^Tk } < KiUkQk£k- 

Proof . The right-hand side is an upper bound on the probability of damage occurring in rj'^ in 
Ik+i X [vk+i,Vk]- If damage does not occur there and J^k+i holds then the control delegation 
property implies J-k. 
Let 

l\{k: Vk+i < Uk+i oi k^ K}, 

t2 = Un. 

3. Consider the case n < K. 

Then we have u„ < u„ < + (C2 + ^)T*k+i- Let 

J = r(/„, ((C2 + 8)r*„+i/r.„ + 3)s„). 

Let B be the event that J x {u„} is blue in ry". Since 77" is (Cii?„, (T„)-blue at time w„ for each 
n, the probability of -iB is at most K.2Un<yn for some constant ^2. 
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3.1. There is a constant K3 such that 

Prob{ B n -^Tn } < l^?,UnQn£n- 

Proof . The right-hand side is an upper bound on the probabihty of damage occurring in rf^ 
in J„ X [M„,ti„]. If damage does not occur there and B holds then the lasting control property 
implies J-n- 

Thus, the probability that (xi, ti) is not blue can be upper-bounded by 

n-l 

n 

i=l 

4. Consider the case n = K: then the space is finite. 

Let B be the event that the whole space Zjv is blue in 77". Since 77" is (Cii3„, (T,i)-blue at time u„ 
for each n, and 5i?„+i < N ^ the probability of -iB is at most K2Qn<^n for some constant K2- 

4.1. There is a constant K3 such that 

Prob{ B n -^Tn } < H3UnQntl/T,,,en. 

Proof . The right-hand side is an upper bound on the probability of damage occurring in rj'' in 
Jn X [un, Vn]- If damage does not occur there and B holds then by the lasting control property, 
J-n also holds. 

Thus, the probability that (zi, ti) is not blue can be upper-bounded by 

i=l 

Thus, the probability that {xi, ti) is not blue can be upper-bounded in both cases ||and ^ (ignoring 
multiplicative constants) by 

n-l 

C/„Cr„ -I- UkQkSk + UnQntl/T.^En ^ Ai + A2 + A3. 

1=1 

With the parameters of Example |10.1 we have A2 = 0{e) and A3 can be written, as in the proof of 
Theorem |1| as e''^^^) with 

h2iN) ^iV=2/loglogW^ 

for some constant C2- Also, Ai < e^''*" for some K4 > 0, li n = K then this gives Ai < e^'^^^ . 
Since N = e^^^°sK+o{K) ^-j^^g 

Ai = 0{N-°-^). 

En<K then = 0(ELi ^'fe)- Hence = e2»i°g"+o(i), giving 

Ai = 0(tr°')- 

Both of these bounds are poor if ti is small. But in this case there is a trivial upper bound O(iie) 
on the total probability that (2:1,^1) is not blue: this bounds the probability that there has been 
any damage since the beginning that could influence {xi,ti). □ 
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11. General plan of the program 

Our eventual goal is to prove Lemma |9.4| (Amplifier). The medium Mk+i to be simulated has 
reach 1. The simulation is the composition of two simulations. First, we simulate Mk+i by a medium 
having reach 2, and then we simulate by a medium Mk having the same cells but with reach 



1. The latter simulation uses the construction in Theorem 8.10. The bulk of the work is in the first 
simulation, to which now we proceed. From now on, we fix the level k of the simulation hierarchy, 
refer to medium Mj, as M and to medium Mk+i as M* . The subscript will be deleted from all 
parameters in M and a superscript * will be added to all parameters of M*. We will refer to cells 
of M as small cells, or simply cells, and to cells of M* as big cells. 

According to the definition of an amplifier frame, the medium M has two special fields called 
Rider and Guard, and can also have other fields. These two fields will be subfields of the field Info 
in our medium. In what follows we define the function Sim_trans only: the function Rd_trans is given 



in advance. The updating of Rider and Guard is determined by the rule (9-1). Thus, Sim_trans 
determines the next value of Rider only in case of Guard ^ 0, and the next value of Guard in case 
of Guard < 0. We will not point out always that what we define is only Sim^trans but this is to be 
understood. In particular, when our program requires to update Info then in the subfield Rider this 
will be actually carried out only if Guard ^ 0. 



For simplicity, the proof will only be given for qk = Qk (see 4.1.4). The general case is not more 
difficult, but would need extra notation. The track Info of a colony contains the string that is the 
state of the represented cell, encoded via an error-correcting code. Info consists of two subfields. 
Info. Main and Info.Redun. The track Info. Main contains the intended original information, including 
Rider and Guard. The track Info.Redun contains "parity checks" for the string on track Info. Main, 
but contains only in cell 1 (due to the "controlling" property). Since Rider must control cell 1 of 
the colony in the sense of 4.1.3| , the track Redun does not use cell 1. For the error-correcting code. 



the field Info. Main will be subdivided into "packets" such that the parity check bits for each packet 
will be computed separately. The fields Rider* and Guard* will occupy packets disjoint from the 
other fields and therefore the error-correction for the other fields will proceed without difficulty even 
if e.g. the error-correction for Rider* is prevented by Guard* = 0. 

The program will be described in a semi-formal way; the present section overviews it. Later 
sections restrict the program more and more by giving some rules and conditions, and prove lemmas 
along the way. The typical condition would say that certain fields can only be changed by certain 



rules. The language for describing the rules is an extension of the one given in Subsection [7. 4 We 
will introduce a fair number of fields but they are all relatively small. 



All fields but Info. Main are contained in Buf (see 2.2.1 ) and hence are visible by neighbor cells 
and changeable immediately even by the separable transition function. Cells have an address field 
which determines the only colony (Q-colony) to which the cell belongs. A colony C{y) has base y. 
All properties and relations defined for colonies are automatically defined for the sites of potential 
big cells at their bases and vice versa. The Age field of a cell, called its age, can have values in 
[0, U — 1], but the upper bound will typically not be reached. 

11.1. Damage rectangles. The damage map of the simulation (p = ipi^ was defined in Subsec- 
tion ^.1| . Let us write 

Damage — Damage (77), Damage* = Damage{rf). 



Lemma 8.4 proves the Restoration Property for rj* — f*{f'l) whenever 77 is a trajectory of M. In 
the Computation Property applied to a cell a; of 77*, we are given a rational number a satisfying 
fi{x,a,r]) = 1, i.e. throughout the rest of the construction, it is assumed that Damage* does not 
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intersect the set 

W^{x,a) = [x~2QB,x + 3QB~] x [a ^ T,* /2+, a + 2T'*] 

(where we used the fact that the reach of 77* is 1). Recall sonie definitions from Subsection 
We had V ^ [~B/4,B/A-] x [-r./4+,0]. We have r]*{x,t) G Bad iff Damage{ri) contains at least 
two points u,v such that u + 2V , v + 2V are disjoint and even u + SV,v + 3V are contained in 
(x, t) + W* + {B*/2, 0) (sec Figure |l|). Let 

(11.1) = {u : u + 3y e + (QB/2,0)}. 



The later definitions (13.4), ( 13.10| ) imply Q > 10, T,* > 6T,. This guarantees that the space 



projection of V is at least 1.5QB and the time projection is at least r,*/2. We have ri*{x,t) ^ Bad 
iff there is a rectangle of the form (?/, u) + 2V covering Damage n (x, t) + V' . It is easy to see that 
if Damage* does not intersect then there is a finite set of small rectangles of this latter form 
covering Damage fl Wj* such that each rectangle of the form (z, w) + V intersects at most one of 
them. We will call these small rectangles the damage rectangles. 

Let us say that the damage affects (a;,to) directly unless one of the following occurs: 

(1) {x} X [to — 2T' — r,/2+, to] is disjoint from the damage rectangle; 

(2) There is a dwell period [ii, t2] of x with to - ST* < t2 < to such that {x] x [ti - T,/2+, to] is 
disjoint from the damage rectangle; 

(3) There is a switching time t2 of x with to - 2T' < t2 < to such that {x} x [t2 - T* - T,/2+, to] 
is disjoint from the damage rectangle; 

This definition encompasses the cases when damage prevents the application of parts (|^), (^), (|) 
or (^ of the Computation Property for concluding about (x,to). 

We will say that (x,to) is affected via neighbors if in the above definition, one of the conditions 
does not hold with the interval [x — 2B, x + 3-B— ] in place of {x}. This encompasses the cases when 
damage prevents the application of parts (|) , (0) or (^ (g) of the Computation Property. Thus 
the damage can affect more cells via neighbors but only during the same time interval. 

By the definition of V, the (half-open) damage rectangle has width B and duration T,/2; therefore 
it can affect at most one cell directly, for less than 2T' +r,/2 time units. For each damage rectangle 
we define a corresponding extended damage rectangle, an upper bound on the set of points affected 
directly. Generally, such a rectangle will be denoted by the Cartesian product 

(11.2) [ao,ai~] X [uo+,ui-] 
with ai — flo = B, ui — uq = 2T* + T,. 

Lemma 11.1. Suppose that {x,u) is not affected directly by damage but is affected via neighbors. 
Then x is not affected directly during [u — 5T' , u] and not affected even via neighbors during [u — 
5T',u — 3T']. // (x, u) is the end of a dwell period then x is not affected via neighbors during 
[u-5T\u-2T']. 

Proof. Some damage occurs near x but not in x at some time ti during [u — 2T* — T,/2+, u]. When 
(x,m) is the end of a dwell period then, since x is affected via neighbors, ti > u — T* — T,/2 Then 
it does not affect x directly at all. Clearly, no other damage rectangle affects directly x during 
[11 — 5T*, u], and this damage rectangle does not affect x before ti — T,/2. □ 

11.2. Timing. Let 

(11.3) A = T'/T,. 
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Different actions of the program will be carried out with different speeds. We introduce some delay 
constants: 

(11.4) 4A < po < Pi < P2 

whose value will be defined later. The slowdown uses a subdivision of the state into the following 
fields: 

Wait, Cur, Fut. 

Here, Wait takes values 1, . . . ,p2, and Cur, Fut have equal size. We will work almost exclusively 
with the field Cur, which represents the "current state" . Its subfields Cur.F, will simply be written 
as F. An assignment will typically have the form 

F :=p V. 

This means that F.Fut gets value v and Wait gets value p. The default change of Wait is to decrease 
it by 1 if it is different from 1. When Wait — 1 then Fut will be copied into Cur. Missing subscript 
in the assignment means p = 1. Thus, all examined fields are understood to be the corresponding 
subfields of Cur and all changed fields are understood to be the corresponding subfields of Fut. 

Condition 11.2 (Waiting). 

(a) A field Cur.F can only be changed when Wait.F = 1, in which case it is set to Fut.F. 

(b) A field Wait.F can only either decrease by 1 until it reaches or stay 0, or be set to one of the 
values 1 , . . . ,p2. 

(c) If Fut.F changes then Wait.F will be set to one of the values 1, . . . ,p2. 



E.g., the proper reading of rule 
cond { 

? Addr ^ and Addr^ = 

! Kind :=p Latent 

} 

is that if Cur.Addr{x) = and Cur.Addr{x + B) = then Fut.Kind{x.) must be set to Latent and 

Wait.Kind{x.) must be set to p. 

We will have CurClInbuf = where Inbuf was defined in 2.2.1. Thus, Cur will never be changed 
in one step in response to a change in a neighbor cell. 

11.3. Cell kinds. The values of the address field Addr will vary in [—Q, 2Q — 1]. The cells whose 
colony is their originating colony, will be called inner cells, the other ones will be called outer cells. 
Cells will be of a few different kinds, distinguished by the field 

Kind 

with possible values (for j e {—1,1}) Vac, Latent, Channelj, Growthj, Member, Germ. The kind 
of a non-germ, non-latent cell is determined by Age and Addr. The kinds of cells are ordered by 
strength, as follows: 

Vac < Latent < Germ < Channelj < Growthj < Member. 

Stronger cells prevail over weaker ones in conflicts about who should create a neighbor. The relation 
Kind{x, t) = Vac means that there is no cell at site x at time t. 

Condition 11.3 (Latent Cells). 

(a) A vacant cell can only turn into a latent one. 

(b) A latent cell has Guard = — 1 (else it is bad). 
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A cell will be called dead if it is vacant or latent, and live otherwise. Killing a cell x means turning 
it latent: only when place must be vacated for a new nonadjacent cell whose body intersects the 



body of X will it be turned vacant. This will imply Condition B.7 (Cling-to-Life). Members have 
addresses in [0, Q — 1]. They are the strongest kind in order to maintain the integrity of colonies. 

A right outer cell has (by definition) addresses > Q, and a left outer cell has addresses < 0. Germ 
cells have addresses in [—Q, 2Q — 1]. 

11.4. Refreshing. Since several damage rectangles can occur during a work period, a rule called 
Refresh will be performed several times during the work period and will attempt to correct infor- 
mation by global decoding-encoding. The last step of each invocation of Refresh will be called a 
refreshing step. The number of steps between invokations of Refresh will be 

(11.5) (refresh_time) = T,* /T' 

making sure that at most one damage rectangle may occur between refreshings. 

Certain fields, called locally maintained fields, will be kept constant over the colony, for most of 
the work period. Here are some of these. The field Doomed is 1 if the represented big cell must be 
removed (the site to become vacant), and otherwise. Cells with Doomed = 1 are called doomed. 
Cells will have fields 

Creating,, Growing, (j e {-1, 1}) 



with values or 1. Creating j = 1 will mean that a cell is a creator in the sense of Condition 8.6 
(Creation). Creating j of a big cell will be broadcast into Growing j of its small cells. Growing j — 1 
signifies the collective decision of the colony to grow a new neighbor colony in direction j. The track 

Controloij 

controls the retrieval of information from neighbor colonies. It will be updated periodically by the 
subrule Retrieve. 

The globally maintained field 

End 

will be locally updated by the rule 

Find^end. 

The work period will end when Age — End. This will help setting the absolute duration of the work 
period more predictably. The default value of End is C/ — 1 and we will always have End > Age. 

Each locally maintained field F has some update ages: ages when F will be recomputed. An 
interval of size AQ around each of these ages will be called an update interval for F. 

11.5. A colony work period. The main stages are as follows: 
Idle; 
Extend ; 
Idle; 

Retrieve ; 

Compute; 

Idle; 

Find-end ; 
Grow; 
Shrink ; 
Finish ; 
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The numbers 

(conipute_start) < (idlc_start) < (grow_start) < (grow_end) < End 

define the ages starting or ending important stages of this program and wiU be defined later. The 
idle stages make sure that 

• the computation and communication times are positioned correctly; 

• faults in one part have limited effect on other parts. 

The numbers up to (idle_start) (i.e. (compute_start) and (idle_start)) are constant, and the numbers 
End— (grow -Start) and End— (grow.end) are also constant. The difference (grow_start) — (idle_start) 
is not constant since End will be a computed value. Here is a short description of the major stages. 

Extension: The subrule Extend tries to extend some arms of the colony left and right, to use 
in communicating with a possible non- adjacent neighbor colony. In direction j, if it is not 
adjacent to another colony it will extend an arm of cells of kind Channelj. In channel cells in 
the positive direction, the Addr field continues its values through 

Q,g+1,... ,2Q-1. 

Similarly in channel cells in the negative direction. Channel cells are weaker than member 
cells, so the growth of a channel does not normally damage another colony. The channels will 
be killed at the end of the computation. 
Retrieval: Retrieval starts at age (compute_start) . During this, the colony tries to retrieve the 
state represented by it and its neighbor colonies. The neighbor colonies will cooperate in this, 
having a dedicated mail track and control track for serving each neighbor. Atomicity will be 
guaranteed by waiting, similarly to the proof of Theorem 8.10| (Reach Extension) . 



Computation : The subrule Compute computes the output of the simulated transition function 
and stores it on the track Hold. It will put Doomed = 1 into each cell of the colony if the 
represented cell is to be erased. We will always set Doomed = at Age — (compute_start). 

Idling: After the computation, some idling puts sufficient time between it and the end of the 
work period to make sure that the retrieval falls into the observation part of the work period. 
Procedure Find_end computes End. 

Growth: If Growing ^ — 1 then, between values (grow_start) and (grow_end), the colony tries to 
extend an arm of length at most Q in direction j. These cells are of kind Growthj, or growth 
cells. 

Birth: A latent cell x turns immediately into a germ cell with address —Q and age 0. The germ 
thus started begins to grow to the right, trying to fill 3 colonies until age (germ_grow_end). At 

Age — (germ_cnd) — 1, 

germ cells turn into member cells. 

Germs will implement the self-organizing property of the medium. 

Shrinking: When they reach the end of their growth period, growth and germ cells stop produc- 
ing offshoot. In what follows, all edges whose existence is not justified by these processes (these 
edges will be called "exposed") will be subject to the process Decay. Therefore normally, a 
growth either disappears before the end of the work period or it covers a whole new colony by 
that time. Similarly, germ cells are required to cover 3 neighbor colonies. 

Finish: Rule Finish will be called when Age = End. It reduces the addresses of growth cells 
mod Q. Outer cells and inner germ cells turn into members. If the colony is doomed it 
will be erased, (starting from the right end, since a doomed right endcell will be considered 
inconsistent with its neighbors). Otherwise, the information from the Hold track will be copied 
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into the corresponding locations on the Info track. This final rule will take only a single step, 
and will also be called the cut. 

11.6. Local consistency. The basic structural soundness of a colony will be expressed by some 
local consistency conditions. Two cells locally consistent with each other will be called siblings. For 
J e {-1,1}, let 

Sib{j) = 1 

if cell X is a sibling of its adjacent neighbor in direction j, and Sib{j) = otherwise. Sometimes, 
rules refer to the relation Sib{) but can be understood without going into the details of the definition 
of this relation itself. 

11.6.1. Siblings. Two aligned cells x and x + iB will be called space- consistent if Addr{x + iB) — 
Addi{x) = i (mod Q). 

Let us consider cells x and y = x + jB for j = —1, 1, with \Addr{x)\ < \Addr{y)\. These two cells 
belong to the same work period if 

< Age{x) - Age{y) < 1. 

They straddle a work period boundary if Age{x) = 0, Age{y) = End. They are siblings if one of the 
following two properties holds. 

(1) x,y belong to the same work period, originate at the same colony, either are both germ cells or 
neither of them is, and x is not a doomed right endcell with < Age < (compute_start). 

(2) X, y belong to the same colony, and straddle a work period boundary. 

Of course, siblings are space-consistent. An interval of cells in which the neighboring cells are siblings 
will be called a domain. A domain of size n will also be called a n-support of its members. Let us 
call two cells relatives if they can be embedded into a common domain but are not necessarily in 
one. A colony with starting cell x will be called full if it is covered by a domain in such a way that 
Addr{x + iB) = i (mod Q). 

11.6.2. Age updating. The rule for updating age is similar to the "marching soldiers" rule for up- 



dating Age in Subsection 7.1. As seen from the definition of siblings above, we impose some extra 
order on age: the age of all cells in the same extended colony must be non-increasing as they become 
more distant from the cell with Addr — 0. Also, there will be a bit 

Frozen 

with the property that when Frozen = 1 then Age will not be changed. Here is the basic updating 
rule for age: 

rule March { 
cond { 

? Frozen = and Age < End and Vj S {—1, 1} is dead or the 

the increase of Age does not break the sibling relation with dj (x) 
! Age :=p, Age + 1 

}} 

Condition 11.4 (Address and Age). 

(a) Only Finish can change Addr of a live cell; 

(b) Only March and Finish change Age of a live cell; 
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11.6.3. Repairs. The rule Purge eliminates isolated cells. The rule Heal repairs a small hole. An 
unrepaired hole will be enlarged by the rule Decay: this will eventually eliminate partial colonies. 

The damage rectangle can destroy the information represented on its space projection, therefore 
the information representation on the Jnfo track will be a redundant, error-correcting code. Infor- 
mation will be decoded before computation and encoded after it. The damage can also disrupt the 
computation itself therefore the decoding-computation-encoding sequence will be repeated several 
times. The result will be temporarily stored on the track Hold. 

11.6.4. New colonies. The following condition helps enforce that newly created cells in the simulation 
are latent. 

Condition 11.5 (Outer Info). Suppose that (p*{ri){x, t) ^ Bad, and the colony with base x at time t 
is full and is covered with member cells belonging to the same work period, and is not intersected 
by the extended damage rectangle. Then ip*{ri){x,t) depends only on the Info track of this colony 
via some decoding function a* . If the colony is covered with germ or outer cells then the state that 
would be decoded from this track is latent. <0 

11.7. Plan of the rest of the proof. In order to preserve intelligibility and modularity, rules 
and conditions belonging to the program will only be introduced as they are needed to prove some 
property. 



The crucial Lemma 14.7 (Attribution) says that soon after the disappearence of the big damage 
Damage*, all live non-germ cells not immediately arising from Damage can be attributed (via a 
path of ancestors) to some nearby colonies (all disjoint from each other). Thus, this lemma enables 
us to reason about the process over this area in terms of big cells. The lemma is important e.g. in 
seeing that a big cell can grow a neighbor if no other cell is nearby. In terms of colonies, this means 
that if no other colony is nearby, then a colony can grow and create a neighbor colony. This is 
not obvious since there could be "debris" : earlier damage could have left bits and pieces of larger 
colonies which are hard to override locally. The Attribution Lemma will guarantee that those bits 
and pieces are not there anymore at the time when they could be an obstacle, since whatever is 
there is attributable to a big cell. 

The Attribution Lemma also plays a role in healing. Most local healing is performed by the rule 
Heal. However, if the damage occurs at the end of some colony C then it is possible in principle that 
foreign material introduced by damage is connected to something large outside. The Attribution 
Lemma will imply that the foreign matter is weaker and can therefore be swept away by the regrowth 
of the member cells of C. 

The idea of the proof of the Attribution Lemma is the following. Suppose that (a;o,io) is a cell 
whose origin we want to trace. We will be able to follow a steep path {xi , tj) o f "ancestors" backward 



in time until time t„ ~ to ~ mQ with some large coefficient m. Lemma 13. 3| (Ancestor) shows that 



it is possible to lead a path around a damage rectangle. The attribution consists of showing that 
{xn,tn) belongs to a domain covering a whole colony. To prove this, we will show that the rule 
Decay, which eliminates partial colonies, would eventually cut through the steep path unless the 
latter ends in such a domain. In actual order, the proof proceeds as follows: 

• Some of the simpler killing and creating rules and conditions will be introduced, and some 
lemmas will be proved that support the reasoning about paths and domains. 



We prove the Ancestor Lemma. Lemma 13. 5| (Running Gap) says that if a gap is large enough 



then the process Decay propagates it fast, even in the presence of some damage. 

Lemma 14. 2| (Bad Gap Inference) shows that (under certain conditions and in the absence of 



damage), if there is a gap at all then it is large enough in the above sense. 
The above lemmas are used to prove the Attribution Lemma. 
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Here is a summary of the roles of different delays: 
Pq: Default; 

Pi: Decay and computation; 
P2- Growth; 
Here is a summary of the rest of the proof. 

• We define those computation rules not dependent on communication with neighbor colonies. 

• Lemma 16.5 (Legality) shows that the computation terminates gracefully independently of the 
success of communication. 

• The development of colony C will be followed forward to the present in Lemma 16.7 (Present 
Attribution) . 

• Finally, the retrieval rules will be defined and the remaining part of the Computation Property 
will be proved. 
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12. Killing and creation 

The present section collects some rules, conditions and properties that will be needed in the rest 
of the construction. The material is somewhat heterogenous, but is related mostly to consistency, 
killing and creation. 

12.1. Edges. Let 

e_i =0, ei = Q - 1. 

A cell a; is a colony endcell in direction j if Addr{x) = ej (mod Q). Outer or germ cells before the 
end of their growth ((grow.end) resp. (germ_grow_end)) are said to be in their expansion period, and 
are called expansion cells. 

Suppose that cell x has no siblings in direction j. It will be called a protected edge in that direction 
if it is some legitimate boundary in that direction, in the sense described below; otherwise, it will 
be called an exposed edge. For each kind of cell listed below, the cell is defined to be protected if the 
statement listed after the colon is true; othewise, it is exposed. 

Member: Colony endcell in direction j, except if it is a doomed right endcell with < Age < 

(computc_start) ; 

Expansion: In direction j if this is the direction of expansion, except when it is a channel cell 
with Age > (idle_start); 

Non-expansion, germ: Outer colony endcell in direction j if this is the extension direction and 

the cell is not outer with Age > (germ_end) — 2Q; 
Non-expansion, outer, non-germ: Any colony endcell in direction j if this is the extension 

direction; 

In a rule, the condition that x is an exposed edge in direction j will be expressed by 

Xposcdj = Xposcdj{x) = 1. 

An exposed edge is the sign of defect, or a call to eliminate an extension of a colony or a colony; the 
decay rule will kill an edge cell if it stays exposed too long. 

Lemma 12.1. // a left exposed edge dies and its right sibling was not a colony endcell then this 
neighbor becomes a left exposed edge. The same holds if we replace left with right. 

Proof. This is obvious in most cases. One case when it is not is when the exposed edge is a left 
outer cell that is not an expansion cell. It is imaginable namely that its right neighbor is still in 
the growth stage. However, our definition of siblings requires the ages of cells to be monotonically 
nonincreasing as we move away from the originating colony, therefore this is not possible. 

The situation is similar when a doomed exposed right endcell dies. □ 

A m,ultidom,ain is either a domain or the union of some adjacent space-consistent domains meeting 
in protected colony-endcells. From the above definitions it is clear that only a cut can turn a domain 
into a multidomain. 

12.2. Killing. Generally, a cell will be "killed" by making it latent. Killing will be announced first 
by making a one-bit field 

Dying 

equal to 1. (The default value of this field is 0.) This feature will disable the cell from creating a 
live neighbor (just in the interest of continuity and ease of reasoning). So, whenever the program 
will kill a cell it will call the following subrule: its argument determines its speed. 
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subrule Die{p) { 
Dying :=p 1; 
Kind :=p Latent 
} 

The killing of a cell is almost always a corrective action, except when the definition of "protected" 
makes certain cells exposed with the intention of killing off a whole channel or colony. 

Condition 12.2 (Dooming). The only rules that change the field Doomed without killing the cell are 
the following. 

(1) The subrules Animate and Heal when they create a non-doomed cell; 

(2) At Age = (compute_start) , we set Doomed = 0; 

(3) At Age = (idle_start) we possibly set Doomed — 1. 



We call two cells partners if one of the following cases holds: 

(1) They are relatives at distance 2B and the cell between them is not their sibling; 

(2) They are adjacent and changing the age of one of them by 1 makes them siblings; 

When a cell has a partner then it may happen that one of its neighbors will be corrected in a short 
while to become a sibling, therefore the cell will b e froze n by the rule Freeze below. A frozen cell's 



age will not be advanced (seen the rule March in 11.6.2), making the neighbor's correction easier 



rule Freeze { 
cond { 

? X is exposed or has a non-dying partner 
! Frozen := 1 
?! Frozen 
}} 

Condition 12.3 (Freeze). Only the rule Freeze can change the field Frozen. (} 

12.3. Creation, birth, and arbitration, 
rule Create { 

pfor j e {-1,1} do { 
cond { 

? Kind = Vac and Creating^-' = 1 

! Kind := Latent 

}}} 

This rule is slightly different from the other ones since the site to which it is applied is vacant. 
Therefore in simulation, the cell x is not really there to "apply" this rule; it is applied implicitly by 
its creator neighbor i?_j(x). We will call this neighbor the mother of the new latent cell, 
rule Birth { 
cond { 

? Kind = Vac and Kind^ = Vac -1, 1) 

! Kind :— Latent 

}} 

This rule tries to give rise to a newborn cell if its neighbors are vacant. As the Computation Property 
shows the birth rule will not be enforced by the trajectory property. (Still, birth will be realized for 
big cells when germ cells succeed in creating a new colony.) 
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Condition 12 A. Create and Biitli are the only rules applicable to a vacant cell. () 



Condition 12.4 implies that in all cases different from the one listed in the rules Crea te or Birth, the 
site is required by the transition function to remain vacant. Case (^) of Condition 8^ (Computation 



Property) allows for the creation to be blocked by a cell whose body intersects the cell to be created. 

Let us proceed to the definition of rule Arbitrate which controls the values of the field Creating ^ . 
Consider the example of a cell x and its left nonadjacent neighbor y = x^_i.5(a::) that may want to 
create a cell in y -\- B, overlapping the body of x. Whether x will be erased will be decided not by 
whether y is stronger than x but by whether the new cell y + B would be stronger than x. 

Remark 12.5. This distinction matters when a colony wants to create an outer cell that would 
intrude into another colony. It is important in this case for the created cell to be weaker than the 
member cells of the other colony with whom it is competing for space. 

To simplify the expression of the rule, for a relation R, let 

R 

a <b 

mean "a < 6 or (a = & and R holds)" . The kind of the cell to be created must have been declared 
in field Kind^j of 'dj(x). This field is actually a function of two fields called 

Kind j. Crow, Kind j. Heal, 

which will be set by the rules Crow^step. active and Heal (see later), and is defined as 

{ Kind j. Heal if Dying — 0, Kind j .Heal ^ Latent, 
Kind j. Crow if Dying — 0, Kind j .Crow ^ Latent, 
Latent otherwise. 

Condition 12.6. The default value of Creating^ is 1. The only rule changing Creating^ is Arbitrate. 



rule Arbitrate { 

pfor j e {-1,1} do { 
cond { 

(1) ? Kind ^Latent and Kind^_:^/ > KindJ^ and Creating^' = 



it and Kind"_iy 

! Kind :— Vac 



(2) ? Kind ^ Latent and (Kindly > Kind or Kind^_:^/ > Kind) 
! Die(po) 

? Kind' ^ Vac 

(3) ! Creating^ := 
?! Creating J := 1 
}}} 

Part (H) erases a cell if another cell must be put in its place (initiated by a non-dying cell) that is 
stronger or has the same strength but is initiated from right. Part (|l]) erases a latent cell fast. This 
part will be stronger than any other rule possibly confiicting with it (which, in other words, would 
just set some fields of the cell instead of erasing it). The role of the condition Creating^-' — will 
be seen in the next lemma. According to (^), the rule turns off Creating j{x) as soon as the job of 
creating the neighbor x + jB is done. 
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Remark 12.7. The rule also shows that the default value of Creating^ is 1. Therefore even a latent 
cell creates a neighbor if nothing is in the way. This will be used for the self-organization properties. 

The following lemma shows that the rules Arbitrate and Create indeed succeed in creating a new 
cell. Here, cell x — B will create a cell at site x. Creation from the right is analogous. Let 

(12.1) T, = (k + l)T'. 

Lemma 12.8 (Creation). Assume the following, with I — [io,^o + To + 4r*].- 

(a) [x — B^x + 2B] X I is damage- free; 

(b) r](x — B, t). Dying = and 

■q{x - B,t).Kindi > 'q{y,t).KindVT]{y,t).Kind^i 

for all {y, t) in [x+, x + 2B~-] x I; 
Then r]{x, t) ^ Vac for some t in I . 

Proof. Let us assume, on the contrary, that x is vacant during all of /, and we will arrive at a 
contradiction. 

The conditions and the rule Arbitrate imply that we have rj{x — B,t). Creating ^ = 1 at some 
^1 < ^0 + To and this stays so while x is vacant. Now the rule Create requires that x become a 



cell. However, Condition B.8 (Computation Property) requires x to actually become a cell by time 
ti + 2T' only if the body of no existing cell intersects with the body of x: see case (^. Suppose 
therefore that some cell y intersects the body of x during [ti,ti + 2T*]. 

1. Suppose that y does not disappear before to + tq. 

Then part (|^) of rule Arbitrate implies that during this time. Creating _i{y + B) becomes 0, hence 
y + B loses the ability to recreate y fast. Part (|^) of the same rule makes y latent within tq time 
units. Part (|l|) then erases y within 2T' units of time. 

Any new cell y' causing a similar obstacle to creating x could arise only if y + B creates it. 
Indeed, the only other way allowed by the Computation Property is case (Q) there; however, this 
case, reserved for the possible appearence of a latent cell out of "nothing" , (indeed, out of lower- 
order germs) requires "djiy' , t) to be vacant for all j, i.e. that y' have no (adjacent or non-adjacent) 
neighbors. But, x ~ B would be such a neighbor, so y' will not appear. 

Thus, X will be created within 2T* time units after the disappearence of y. 

2. Suppose that y disappears before to + 'Ha- 
lf it does not reappear within 2T" time units then x will be created as above. Suppose therefore 
that y reappears. When it reappears we necessarily have Creating_i{'y + B) = 1. This turns 
within tq time units. After it turns 0, the rule Arbitrate erases y within 2T* units of time 
and then x will be created in 2T' time units. Cell y will not be recreated to prevent this since 
Creating _^{y + B) = for at least poT, > AT' time units. 

□ 

12.4. Animation, parents, grov^rth. A latent cell x can come to life by a rule in three ways: by 
the rules Animate, Heal, or by starting to develop as a germ cell. The subrule Animate{j,p, . . . ), 
j £ {—1,1}, gives the new cell the appropriate field values. 

The field Becoming (default value 0, reset when Animate is not applicable) is used to slow 
animation by its assignment parameter p. The condition Dying = in the rule below makes sure 
that (in absence of damage) the mother is still alive when x becomes live. The subscript po in the 
last assignments makes sure that once the decision is made to revive the cell, its age is set fast 
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enough in order that it does not stay much behind the age of the creating cell. This way, the created 
cell becomes a sibling of the creating one. 

subrule Aniniate{j,p,Fi,vi,F2,vi, . . . ) { 
cond { 

? Kind = Latent and Dying-' = and ( 

(t?2j(x) is a sibling of i?j (x) and Dying^ = Dying^^ = 0) 

or {Kind-' = Germ and Addr-' = —Q and Dying-' = and 

there is no non-germ neighbor towards —j) ) 
! cond { 

? Becoming = 

! Becoming :—p 1 

?! Ilpfor i = 1,2,... do Fi :=p„ v, } 
? ! Becoming 
}} 

The cell i?j(x) used here is called the mother cell. In case the same result could also have arisen 
using cell t^-j(x) then the mother cell is the one closer to the center of its colony. 
The following lemma is immediate from the definition of the animation rule. 

Lemma 12.9 (Animation Support). Suppose that 

(a) a cell x has just been animated at time t by a non-germ neighbor y = '&j{x) (this rule, with 
observation time t' being a possible explanation for its becoming live); 

(b) x+[-3B,4:B] xt+[-3T',0] is damage-free; 

(c) there is no colony-boundary between y and, its sibling required by the rule ; 
Then y and its two siblings survive until after t. 

Proof. The animation requires a sibling for y with both y and the sibling non-dying. Due to the 
minimum delay po in dying which they did not even begin, these cells remain live till after t. Since 
there is no colony boundary between them, a cut will not break the sibling relation of these cells 
either. □ 

Remember that whether a cell has kind Channel or Growth can be determined from its age. 
Therefore it is sufiicient to have one value 

Extj 

in place of Gro-wthj and Channel j. Rules Extend and Grow both rely on the following subrules. 
subrule Grow^tep.active{j) { 
cond { 

? Kind = Germ and j points away from the colony center 

! Kind j. Grow := Germ 

? (Addr = ej or Kind = Extj) and Sib{-j) 
! Kindj . Grow : = Extj 
? ! Kindj. Grow := Latent 
}} 

subrule Grow-step.passive{j) { 
cond { 

? Kind = Latent and Kindj-' G {Germ, Extj} 

- " 

and Kindj > Kind_j 

! if possible, make x consistent with -d-j (x) using 
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Animate{—j,p2, Kind, Growthj, . . . ) 
}} 

The rule Extend serves to extend the channel in direction j during the computation time of the 
colony. 

rule Extend { 

pf or j = —1,1 do { 
cond { 

? Age e [0, (compute_start) — 1] 

! Grow_step.active{j) 

? Age~^ e [0, (compute_start) — 1] 

! Grow^tep.passive{j) 

}}} 

The rule Grow depends on the fields Growing j. The computation rule (to be defined below) turns 
this field to 1 in all cells of the colony iff the field Creating j in the big cell represented by the colony 
has value 1. Otherwise, it will be in all cells. The healing rule, to be given later, keeps this locally 
maintained field constant throughout the extended colony. 

rule Grow { 

pf or j = —1,1 do { 
cond { 

? Age e [(grow^tart), (grow_end) — 1] and Growing j — 1 
! Grow_step.active{j) 

? Age^-' G [(grow_start), (grow_end) — 1] Growing"-' — 1 

! Grow-Step.passive{j) 

}}} 

For germs, the growth rule is similar. However, growth will proceed always to the right and the time 
intervals in which it occurs will be defined later, in the germ program. 

12.5. Healing. Let Fi := vi,... ,Fk = Vk be an assignment changing some fields of x. This 
assignment is an internal correction if the following conditions hold: 

(a) For j e { — 1, 1}, we have Dying-' = 0. Also, Frozen^ — 1 unless ^^^(x) is a sibling of x. 

(b) After the assignment, but not before, x and its two neighbors form a domain in which the 
following holds: for each locally maintained field F, if Age is not in the update interval of F (as 



defined in Subsection 11.4) then F = for j G { — 1,1}. 
(c) In each direction j, the domain, with constant values for the locally maintained fields, continues 
to the second neighbor unless the first neighbor is a protected colony endcell towards j. 

If X is latent and one of its neighbors is an endcell of its colony protected in the direction away 
from X then the internal correction will be called a near- end- correction. An assignment to make a 
non-germ cell is an end- correction to the right if the following holds: 

(a) Dying~^ = 0. Also, Frozen~^ = 1 unless ??_i(x) is a sibling of x. 

(b) After the assignment, but not before, the following holds: x is the right-protected right endcell 
of its colony with i9_i(x) in its domain, with Age — Age~ , and for each locally maintained 



field F, if Age is not in the update interval of F (as defined in Subsection 11.4 ) then F — F~^; 

(c) The domain, with constant values for the locally maintained fields, continues to ^9_2(x). 

(d) If X is a right outer cell then Age > (grow_end); 

End-corrections in the left direction are defined similarly. Note that end-correction does not create 
a germ cell. Each of the corrections above is called mild if it can be achieved by only changing some 
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locally maintained fields different from Age and Addr. Internal corrections are called weak if (g) is 
not required. Let us denote by 

Int_corr{x) — 1, End_corrj{x) = 1 

the fact that an internal correction is possible in x or that an end-correction in direction j is possible 
in X. The fact that a weak internal correction is possible will be denoted by 

Int-Corr {x) — 1. 

We need the notion of weak internal correction since cell x cannot check Int_corr(z?_i(x)) directly, 
only Int_corr'(i?_i(x)). 

rule Heal { 
cond { 

? Int_corr(yi) = 1 and 
if Ji3fc.corr'(i?_i(x)) = 1 

then that correction would result in a weaker cell ■d-i{x) 
! cond { 

? the correction is mild 
! carry it out 
? Kind ^ Latent 
! Dieipo) 

? ! correct using Animate {j,pQ, . . . ) for some j G { — 1, 1} } 

(1) ? 3jG{-l,l} End_corrj{y:) = 1 and Int_corr'(i9_j(x)) = and 

_ ■ j=-i 

X is no endcell in direction —j and End^corr- j (x) = and Kindj ^ > Kind^_j 
! cond { 

? the correction is mild 
! carry it out 
? Kind ^ Latent 
! Die(po) 

? ! correct using Animate{j,po, . . . ) } 
? 3j £ {— 1, 1} there is an end-correction in direction j in i?j(x) (then j is unique) 

( 2 ) ! Kindj . Heal : = Kind ; 

?! pfor j e {-1, 1} do Kindj.Heal := Latent; 
}} 

Part (0) uses the fact that our cells have a reach greater than 1, seeing their second neighbors. If 
an internal correction will be carried out then of the two neighbors of x, the one closer to the center 
of the colony is called the mother of x while the one farther from the center is called the father. 

rule Purge { 
cond { 

? X is isolated and is not an endcell of a near-end-correction 

! Die{po) 

}} 

Condition 12.10 (Animation). 

(a) The only rules creating a live cell are Animate and Birth; 

(b) Animate will be applied only by Heal or Grow^tep, and it will never create an exposed cell; 

(c) The birth rule will be applied to cells with no live neighbors; 

<> 
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Condition 12.11 (Killing). 

(a) The rule Dic{p) is invoked always with p > po', 

(b) A cell can only be made vacant by the rule Arbitrate; 

(c) Only the following rules can kill a cell: Heal, Arbitrate, Decay, Purge; 

(d) A non-exposed edge can be killed only if a neighbor has Kindj ^ Latent. Setting Kindj ^ Latent 
happens only in the growth rules and in end-healing; 



12.6. Continuity. Our terminology turns out to be incestuous: a child cell can only be created if 
it also becomes a sibling. 

Lemma 12.12 (Parent). Suppose {x,u) is not in any extended damage rectangle, and x becomes 
animated. Then there is a j { — 1, 1} and t' G [u — 3T*+,u — T',/2] with the following properties. 

(a) Some rule is applicable at time t' invoking the animation of x with mother x + jB. 

(b) X + jB has a state at time t' that makes it a sibling of {x, u). If the internal correction case of 
the healing rule applies then {x — jB, t') is a father of (x, u); 

(c) For any time t E [u — T',u\, if x and its mother (resp. father) are not in any extended damage 
rectangle during [t' ,t] then they are siblings at t; 

(d) If X is not affected by the damage via neighbors at time u then we can choose t' > u — T' . Also, 
t' can be chosen anywhere before u — T* if this is also before the time projection of the damage 
rectangle. 

Proof. Since x is not affected immediately by damage, legality implies that the change is the delayed 
result of the application of Animate. 
1. Let us prove (^) and (|^) first. 

Suppose first that x is not affected by the damage via neighbors at time u. Then at the observation 

time t' corresponding to the switch {x,u), we have the situation described by (J^). 

Suppose now that x is affected by the damage via neighbors at time u. Then, according to 



Lemma ( 11.1 ), it is not affected via neighbors during [u — 3T',u — 2T']. Let t be a switching 
time of X in this interval (in a moment, we will see that t exists.) Then at the observation time t' 
corresponding to the switch {x, t), we have the situation described by (^. Indeed, there are at most 



4A steps of X between t and u; but the delay parameter of Animate is at least po, so since (11.4) 
implies 4 A < poj the observation time t' must have occurred during the wait. 

2. Let us prove (0). 



According to Condition 12. IC (Animation), the applied rule was either Heal or Grow_step. In the 
healing case, the parents are frozen and non-dying, therefore they will not change their age for a 
while yet in any way but healing. 

In the growth case, the age of the child is made equal to the age of the mother at a not much earlier 
time t" since once animation has been decided the assignment during animation happens fast (with 
delay Pq). Therefore the mother has time for at most one increase (with delay P2) of age in [t" — , u], 
and this will not break the sibling relation. The mother (and father) does not die since the rule Die 



would have announced this at least pqT, time units earlier (see Condition 12.11) via the field Dying 



and this would have turned off the animation or healing of x ((11.4) implies pqT, > 2T'). 

□ 

Lemma 12.13 (Glue). Suppose that the adjacent cells x,x + B are siblings at time to, and the 
damage does not intersect the rectangle 

{x,to) + [-2B,3B] X [-4T',T']. 
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Suppose also that at the next moment that is a switching time of one of them, this is a switching 
time of X and this breaks the sibling relation. Then we have the following possibilities: 

(1) a cut; 

(2) X ~ B was not a sibling of x at the last observation time of x and the switch kills x by Heal, 
Decay or Arbitrate; 

Proof. 

1 . If a cell X breaks a sibling relation by a rule then one of the cases listed in the statement of the 
lemma holds. 



This follows from the definition of siblings, Conditions 11.4 (Address and Age), 12.11 (Killing) 
and the healing rule. 

We will show that if neither of the possibilities listed in statement of the lemma holds then the cells 
remain siblings. 

2. Suppose that x ov x + B were animated at some time in [to — T*,to]; without loss of generality, 
suppose the latest such time was ti and the cell was x + B. 

Then x + B will be without an age change for at least poT, time units which, due to the fact 



that (11.4) implies po > 4A, is longer than the whole period under consideration. If x also 
underwent animation during this interval then the same is true for it, hence the two cells remain 
siblings. Suppose therefore that x has been live during [to — T'^to]. The rule Animate i mplies 



that x + B is not a germ unless x is one of the parents. If a; is a parent of x + B then Lemma 12.12 



(Parent) implies that the two cells remain siblings for at least T* time units; after this, both cells 



have seen each other as siblings and therefore Condition |11.4| (Address and Age) shows that they 
remain siblings until a cut or a death. 

Suppose that x is not a parent of x + B. If it has changed its age within the last AT* time units 
then it will not change the age for a long time after, and the two cells remain siblings. If it has not 
changed its age within this time then for at least 2T* time units before the observation time before 



the animation, it already is a partner of the mother x + B. The rule Freeze of Subsection 12.1 
implies then that x is frozen which keeps x and x + B siblings. 
. Suppose now that both cells have been live during [to — T* , to] ■ 

If X changes its age within this time then it will not change its age soon and therefore remains a 
sibling. Suppose therefore that x does not change its age during this time. If x + B was a sibling 
all the time during [to — T' , to] then x sees that x + B is a sibling and will not break the sibling 
relation. Suppose therefore that x + B changes its age within this interval and becomes a sibling 
of X this way. Then x had ample time before this age to observe that a; + _B is a partner. Therefore 
X is frozen and will not change its age at the next switch. 

□ 



Remark 12.14. The lemma essentially also holds if we exchange left for right and x + B for x — B, 
with the following modification: 

In this case there is yet another possibility for x to break the sibling relation by a rule: namely, 
when X is a doomed right end membercell of its colony turning to Age — 1 . Indeed, case d) of the 
definition of siblings says x is not a sibling of its left neighbor anymore if this happens. (} 

Lemma 12.15 (Exposing). An edge turns into an exposed one by a rule only in the following cases: 

(1) doomed right end membercell of its colony turning to Age = 1; 

(2) channel cell, Age = (idle_start); 

(3) outer germ, Age = (germ_end) — 2Q; 
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(4) growth, non-end. Age = (grow.end); 

(5) germ cell. Age — (germ_grow_end), except when it is an outer edge of an outer colony endcell; 



Proof. Direct consequence of the definition of siblings and exposed edges and Conditions 
dress and Age), f2.11 (Killing). 



11.4 



(Ad- 
□ 
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13. Gaps 

The main lemma of this section is the Running Gap Lemma, saying that if a sufficiently large gap 
is found in a colony then this gap will not be closed, but will sweep through it, essentially eliminating 
a partial colony, and serving as a preparation to the Attribution Lemma of the next section. We 
start by collecting here some constants and inequalities, for later reference. For clarity, we omit the 
notation [-J for integer part. Recall that by definition. End — (grow_start) and End — (grow_end) are 



constant. We use a constant Ki that can be computed from the program, see (16.7) below. Recall 



the aennition 


= (Pi + IjT' m (12.1). 






Pq 


= 4A + 1, 


(13.1) 


Pi 


= 4Apo- 


(13.2) 


P2 


= 2Api, 


(13.3) 


(spht_t) 


= 14ri +r.. 


(13.4) 


Q 


> 250 > 12(split_t)/ri, 


(13.5) 


(compute_start) 


= 3Qp2/pi = 6QA, 


(13.6) 


(compute_time) 


= KiQCap/w, 


(13.7) 


(idle-Start) 


= (compute_start) + (compute_time) 


(13.8) 


(synch_start_lb) 


= (idle_start)(l + A), 


(13.9) 


U 


> ((synch_startJb) + Q)pi, 


(13.10) 


(end_period) 


= 6(3piA, 


(13.11) 


(grow_start) 


= End — (end_period) + AQ, 


(13.12) 


(grow_end) 


= (grow_start) + 6QX, 


(13.13) 


(crit) 


= 3Qri, 


(13.14) 


(germ_grow_end) 


= 12QpiA, 


(13.15) 


(germ_end) 


= (germ_grow_end) + (end_period). 



Inequality ( 13.9] ) holds in view of ( 9.161 ), if Rq is large enough. Before, (synch_start Jb) is a lower 



bound on the value of Age when End will be computed. Similarly, (synch_start_lb) + (end_period) 
is a lower bound on End. 

13.1. Paths. Suppose that t < u, cell x is live at t and is not in any extended damage rectangle 
during the interval [i, u], and there are no switching times in [t+,u—]. Then we say that {x,t) is 
connected by a vertical link to {x,u).Ii one end of a vertical link is not a switching time then the 
link is called short.li cells x,x + B are siblings not in any extended damage rectangle at time t or t— 
then the points (a;, t), {x + t) are said to be connected by a horizontal link (of size l).Also the pair 
(x, t),{x + 2B, t) is said to be connected by a double horizontal link if there is a near-end-correction 



for {x + B, t).If (y, t') is, according to case (|b|) of Lemma 12.12 (Parent) a mother or father of (x, u), 
we will say that the point (y, t') is connected by a parental (maternal or paternal) link to point 
{x,u).By this lemma (under the appropriate damage- free condition), the parent survives until the 
birth of the child, and therefore the parental link can be replaced by a horizontal link and some 
vertical links. A link is a link of one of these kinds. A link is steep or, equivalently, slow if it is a 
non-short vertical link or a parental link. (Since time is the second coordinate, steepness of a line is 
synonymous to slowness of the movement of a point along it.) A sequence (xq, tg), . . . , (a;„, t„) with 
t'i < tj+i such that subsequent points are connected by links, is called a path. A forward (n, k)-path 
is a path of length n whose number non-steep links is at most k. The adjective "forward" will be 
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omitted when it is obvious from the context. A (n,0)-path is steep, or slow. A backward path is the 
reversed reading of a forward path, backward in time. Notice that a point (xi,ti) on a path can 
actually be dead, if it has just died: indeed, it can be connected e.g. to {xi+i,ti+i) by a horizontal 
link such that tt = i^+i and Xi^Xi^i-i are siblings at i^ — . 
For a path P = (a;o,io), . . . , {xn,tn) and t G [tojin], let 

P{t) 

be Xi with the smallest i such that t G [ii,ti_|_i]. 

The following statement follows immediately from the definition of paths. 

Lemma 13.1. For the time projection d of an (n, k)-path we have 

d> {n-k)T,/2. 

The following lemma says that if two paths cross then they intersect. 

Lemma 13.2 (Crossing). Let (xi, si), . . . , {xm, s„i) and (yi, ti), . . . , (?/„, i„) be two paths with si = 
ti, Srn — tn, Xi < j/i , x„i > Hn- Then there are i,j such that Xi — yj and either tj G [si,Si+i] or 
Si G [tj, ij+i] . 

Proof. Parental links can always be replaced with horizontal and vertical links. Horizontal links of 
size 2 jump over a latent cell only. So, paths cannot jump across each other. □ 

According to the Parent Lemma, a steep path can be continued backward in time until it hits 
some extended damage rectangle. Moreover, occasionally we have a choice between continuing to 
the mother or to the father. Let 

(13.16) (wake) = tq + 2T'. 

If [ao,ai— ] X [uo+,ui— ] is an extended damage rectangle then let 

(13.17) U2 ~ uq + (wake). 
The rectangle 

(13.18) [aQ,ai-] X [uo+,U2-] 

will be called the wake of the damage rectangle in question. The lemma below says that unless a 
path started backward in the wake of a damage rectangle, it can be diverted and continued back 
past a damage rectangle. 



Lemma 13.3 (Ancestor). Let {xo,to) be a live point not in ( 13.18 ), with xq in [oq — QB/A,aa + 
QB/A—] and to in [uq, mq + 7»*/2] . 

Then, there is a path going backward from (xQ,to) and ending either in uq or in a birth. It is an 
(n,2)-path for some n with at most 1 horizontal link. In constructing the path backwards, we are 
free to choose between maternal and paternal links at all times but possibly once, when moving out 
of [ao,ai]. These times, as well as the horizontal link, may only occur during [uq+,U2—]. 

Proof. We call the path to be constructed a desired path. Let us start constructing a steep path 
Co, . . . , c„ with 

— {Xi , ti ) 

backward from (xQ,tQ). If we get to uq then we are done, otherwise, we stop just before we would 
hit ui + T* — , with Ck being the last element. Then tk < ui -\- 2T* and x — Xk & [oq, ai]. Indeed, if 
this is not so then we could continue the path either by a vertical or by a parental link. The vertical 
link would be shorter than than T* , and the parental link would lead to a damage- free cell, so either 
of them would be allowed. 
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Let US now go back on the path for i = k,k — 1, . . . until the first i (counting from k) such that 
either c, is a parent of Ci_i or Xi — x can have a horizontal link at some time during [ti, ti-i]. There 
will be such an i, since otherwise the cell x would be isolated throughout [wi + 2T*,U2] with no 
near-end-correction in a neighbor, and the rule Purge would kill it by the time U2- 

Suppose that Ci is not a parent: then let yo = x and let wq be the earliest time in [ti,ti-i] when 
X has a sibling yi ^ x. Let Wi = wq. Suppose that Ci is a mother: then it has a sibling yi ^ x. Let 
wi = ti in this case. Suppose that Cj is a father: then let yi ^ x he the corresponding mother with 

Wi = ti. 

Let Pi be the part of the original path until Ci_i, and P2 the new part ending in {yi,wi). Let 
us build a steep path {yj,Wj), j = 1,2, . . . , backwards until either wj < uq or yj = x and let P3 
be the part of this path ending with If Wj < uq then Pi, Pi, P3 combine to a desired 

path, suppose therefore that Wj > Uq, hence yj = x. Then {x,Wj) is a parent of {yj-i,Wj-i), hence 
by the Parent Lemma, a; is a sibling of yj-i at time tj-i. By definition, if t is the first time after tk 
when X has a sibhng then t > ti; hence Wj-i < tk < u + 2T* . By the Parent Lemma, {x,uq — T*) 
is also parent of {wj-i,tj-i) (by the same animation): choosing this as {yj,Wj), we are done. □ 

13.2. Running gaps. The rule Decay attempts to achieve that if some "relevant gap" was not 
closed in reasonable time then it becomes wider. 

rule Decay { 
cond { 

? 3je{-l,l}Xposed(i) 
! Die(pi) 

}} 

Remark 13.4. Most killing will be done by this rule. The rule Purge is important only when the 
damage erases a colony endcell and creates an isolated cell, intersecting its former body. The decay 
rule (in this simple form) would take too long to eliminate the new cell which could in the meantime 
widen the gap (and make it unhealable). ■()■ 

Consider an interval G = [l+, r— ] where < r — Ms divisible by B. Assume that if the wake of 
a damage rectangle with space projection [ao,ai— ] intersects G then I < qq < ai < r. We call G 
a gap with right-age n if n is the smallest number k such that every cell in G space-consistent with 
r and not in the wake of a damage rectangle is a germ cell with age < k. The right age is infinite 
if there is no such number k (these cases will not be relevant). The size of G is r — I — B. If G is 
contained in the colony of r then it is called an interior gap. As we see, G is assumed to be a gap 
only in the set of cells that arc space-consistent with r and arc not young germ cells. (It is probably 
not important that we exclude only cells space-consistent with r.) 

Suppose that in a time interval [vo, vi], the gap G{t) = [l{t)+, r{t)—] is defined for all t, in such a 
way that all cells r{t) are space-consistent with each other, and for all ti, t2 with \t2 — ti| < 3T*, we 
have r{t2) — l{ti) > B. Then G{t) is called a (right) gap path, and the right age of the gap path is 
the maximum of the right ages of the gaps in it. By this definition, if a path space-consistent with 
r{t) has the same time projection [fo, vi] as the right gap path G{t) and has no germ cells younger 
than the right age of the gap path then it cannot cross G{t) since no parental link can jump through 
it. Though the wake of a damage rectangle in G{t) may contain some cells that are not young germ 
cells these do not live long enough to become parents (due to the wait of animation) and therefore 
also cannot assist in the jumping of a path. 

The lemma below says that the Decay rule causes a large enough gap to move right rather fast. 
The gap is assumed to be connected, via a series of horizontal links, to a forward path. This excludes 
irregular and unimportant cases when the gap would have to travel through all kind of debris. 
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Figure 16. Running Gap Lemma 



Lemma 13.5 (Running Gap). Let Pi — {xq,vo), ■ ■ ■ , {xn, Vn) be a forward path with at most one 
double horizontal link, let L,k be positive integers with 

L<3Q, 

k < (synch_start_lb) + (end_period) — 5LpiX. 

Assume the following: 

(a) (x„, Vn) is not a germ cell younger than < k + lOLXpi; 
(Jo) (jjotVo) is to the left of {xq,vo) in the same domain; 

(c) if Hq is a left outer cell then Pi is in the same colony; 

(d) No damage rectangle affects [yo+, Xq—] x {vq}, even via neighbors; 

(e) {yojVo) is a left exposed edge (see Subsection 12.1 ); 

(f) yo has been the right end of an interior gap of right-age k and size > 5B during [vq — T',vo]; 

(g) Vn -vo < 2Lti; 

Then during [vQ,Vn], a right gap path G{t) = [l{t)-\-,r(t)—] can be defined, with r(wo) = 2/Oj 



(13.19) 



(r(w„) - yo)/B > (w„ - vq - (wake))/Ti - 4, 



r{t) > yo — B and the right ageof the gap path is < k + bLXpi. If no damage rectangle occurs on the 
whole interval during the time considered then the right-hand side of ( 13.19| ) can be replaced with 

[{Vn - Vo)/ti\. 
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Before proving this lemma, let us prove a corollary saying that if Pi is long there is no large young 
gap next to its beginning (say, on the left). 

Corollary 13.6 (Running Gap). Assume the conditions of the Running Gap Lemma, with L — Q, 
and assume also that the path Pi remains in the colony of yo . Then we have 

(13.20) Vn-Vo<QTi. 

Proof. It is easy to see that the length of the path is at most 5LXpi. By the definition of the gap 
path G{t) above, path Pi starts on its right. Since age varies by at most 1 per link along a path 
(except for a double link) and since (a;„, w„) is not a germ cell y oung er than fc+ lOLXpi, no cell on Pi 
is a germ cell with age < k + 5LXpi, while according to Lemma 13.5 , all cells in G{t) space-consistent 
with r{t) are g erm ce lls with such age. Therefore Pi never crosses the gap path from right to left. 
The inequality ( 13. 1£ ) gives a lower bound on how fast r{t) moves right. Since Pi stays in the colony 
of yo, 

{vn - vq - (wake))/Ti - 3 < Q - 5, 

Vn -vq < Ti{Q - 2) + (wake) < tiQ 
(see ( |l3.l[) and ( 13.16 ) for the last inequality). □ 



Proof of Lemma 13.5. Let r{vo) — yo, and let l{vo) be the leftmost cell such that the gap 
[l{vo)+,r{vo)—] has right-age < k. 

1. Let ti > vo- Assume that for alH < ii, a gap path G{t) is defined with the desired properties and 
in such a way that (r{ti), ti) is not a germ cell younger than k + lOPpiA and is not in the wake 
of a damage rectangle. Then r{ti) is a edge space-consistent with r(<o) that is either exposed or 
is to become exposed in one step, in one of the cases of Lemma I2.15| (Exposing). This last case 
occurs only if there is a damage rectangle during [vo, ti]. 

Proof. Since (r(ti),ti) is not in the wake of a damage rectangle we can build a backward 
(m, 2)-path 

P2 = {{zo,Wo) = (r(ti),ii), . . . , {Zm,Wm)) 

according to the Ancestor Lemma. Lemma |13.1| and the bound (^) gives 

m < 2 + 2{vn - vo)/T, < 2 + ALpiX < bLXpi. 

The backward path P2 ends in at time wq. Indeed, if it ended in a birth then (r{ti),ti) would 
be a germ cell younger than bLpiX contrary to the assumption. P2 does not cross the gap since 
otherwise {r{ti), ti) would be a germ cell younger than k -I- bLXpi + bLXpi which was excluded. 
Therefore defining x = P 2(fo) we have x > yo. Without loss of generality, we can suppose x < xq. 
Otherwise, Lemma 13.2 (Crossing) implies that path P2 crosses Pi and we can switch from P2 to 
Pi at the meeting point. Thus, x is on the interval [yo, xo], aligned with xo. 

Combine P2 and the horizontal path from (x, vo) to (yo , ^0) into a single chain of links connecting 
siblings. This chain has at most 3Q horizontal links from the left edge {yo, vo) to {x,vo), and then 
at most 2 + 2(u„ — vo)/T, links on P2, giving fewer than 

3Q-l-2(v„-wo)/r. + 2 

links. The assumption that the gap path satisfies the requirements of the Lemma implies r{t) > 
yo ~ B, hence P2 is to the right of the left end of the colony of yo- 

In the number of steps available, a protected left edge cannot occur on such a path unless yo 
is a left outer cell and the path leaves the colony of ?/o on the right. Indeed, suppose first that 
{yo,vo) is a member or right outer cell. For a path of ancestors starting from such a cell to turn 
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into a left outer cell this path would first have to walk right to participate in the creation of a 
colony and then walk back through the growth process from that colony, which is impossible due 



to (13 



Suppose now that {i/q, vq) is a (exposed) left outer cell. The age of cells on the path is locally 
monotonically nondecreasing, except when it crosses into another work period. Indeed, since the 
age of siblings is required to be monotonically nondecreasing towards the originating colony, it 
is nondecreasing on the horizontal part of the path; it is also nondecreasing on the part P2 con- 
structed by the Ancestor Lemma, except on the horizontal link allowed by that lemma. Therefore 
the age of {r{t),t) cannot be smaller by more than 1. If the age is not smaller at all then {r{t),t) 
is also exp osed. If decreasing the age by 1 makes it protected then we must have one of the cases 
in Lemma |l2.15 . 



Suppose that {yo,VQ) is a germ cell. Left germ cells inside a colony, as well as member cells 
they can turn into on the path P2 are always exposed. We should not rely on this asymmetry 
in the definition of "exposed" for germs, however, since we want to apply the lemma also when 
changing left to right. Let us assume therefore for a moment that left and right is interchanged 
in the definition of "exposed" for germs. Then a germ cell that is a left edge is exposed to the left 
only if its age is > (germ_grow_end). In this case, the same argument works as for left outer cells 
above. 

, Consider a time interval [/o, /i], assume that the gap G{t) with the desired properties was defined 
up to time /o, where G{fo) has size > 2B and right-age < k. Assume that for t e [/o,/i], no 
wake of a damage rectangle intersects the area where we define the path further. 

Then the gap path G{t) can be defined further in [/q, /i] in such a way that G{t) has right-age 
<k + 2{t- fo)/T, + 1 and size 

> r{fo) ~ l{fo) -B+it- fo)B/{p2T,), 

with 

{r{t)~r{h))/B>[{t- f^)/T,\-l. 



Proof . Let us define G{t) as follows. Suppose that it was defined up to time ti and let ^2 be the 
next time that is a switching time of either l{ti) + B or r{t). We distinguish the following cases. 

(1) t2 is a switch of /(ii) + B. If the switch is an animation resulting from Grow creating a sibling 
of l{t{) then l{t2) = l{ti) + B, else l{t2) = Hh). 

(2) t2 is a switch of r{t). If r{ti) dies then r{t2) is the closest cell to the right of r{ti) that is not 
a germ cell younger than k + 2(^2 — fo)/T, -I- 1 or fc + lOLApi, else r{t2) — r{ti). 

(3) In all other cases, we leave G{t) unchanged. 

2.1. G(/i) has right-age < fc + 2(/i - fo)/T, + 1. 

Proof. Let zi be a live cell in G{fi) space-consistent with r{fi), and let us build a sequence 
(zi, wi), (z2, W2), • ■ • with fi — wi > W2 > ■ ■ ■ as follows. Without loss of generality, assume 
that wi is a switching time of zi. If zi is live at wi— then Z2 = zi and 'W2 is the previous 
switching time of zi. Otherwise, either (zi,wi) is a newborn germ cell, in which case the 
sequence ends at (zi, wi), or it has a parent {x, u). 

If X G G{u) then Z2 = x, W2 ^ u. Let us show that otherwise, zi was created by the internal 
correction part of the healing rule. If a: < l{u) then it could not have created zi by the growth 
rule since by the definition of l(t), this would have increased l{t), bringing {zi,wi) outside the 
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gap. If a; > r(u) then it could not have animated zi by the growth rule since r{u) is an exposed 
edge. The end-correcting part of the rule Heal could not be involved. Indeed, the left end in 
case could only be the left end of the right neighbor colony of yo. In this case, r(u) would be 
a member cell and hence t/o would be a left outer cell. For that case, however, we assumed in 
condition (^) of the lemma that the path (hence also the gap path which is to its left) stays in 
the colony of t/Q. (This is the part of the Gap Lemma where we use the fact that we upper-bound 
only the right-age, i.e. the age of cells space-consistent with r{u): thus, the end-healing of some 
other colony inconsistent with r(u) can be ignored.) 

If the internal correction case of healing created zi, let {z2, W2) be the father of (zi, wi), this will 
be inside the gap. By the parent construction, W2 < wi — T, and Age(zi, wi) < Age{z2, W2) + 1. 
We see that the sequence (z^, Wi) steps back at least T,/2 time units in every step and the age 
of (ziyWi) can decrease by at most 1 in each such step. Since it can only end in a birth or a 
germ cell in G(/o) this and (0) proves the age bound. 



2.2. 



(a) Edge r{t) does not move left during [/o, /i]. During the same period, l{t) moves right at 
most once. 

(b) For aU n > with i = /o + ^tq < /i, we have r{t) > r{ti) + (n - l)B. 



Proof . Let us prove (pj) first. By property 2.1, the gap can contain only latent and germ cells. 



The size of G{t) does not allow Heal to decrease it: indeed, it follows just as as in 2.1 above 
that end-healing cannot operate. Since G{t) is a left gap, the rule Grow can decrease it only 
on the left. After one such decrease, the size is still > B. The next application of Grow is away 
by a waiting period of length p2Tt. 

Let us prove (^). Since the conditions of |l| are satisfied, y = r(/o) is an exposed left edge or 



is a cell whose age is just one step before the applicability of one of the cases in Lemma 12.15 
(Exposing). If y is exposed then the rule Decay kills it within ti time units. In the other case, 
nothing prevents the age of y to increase within ti time units. From now on, the rule Decay 
applies. 



To conclude, note that according to 2^ and ( 13. 2| ), every P2T, time units, the left edge l{t) 



moves at most one cell width to the right but the right edge r{t) moves at least two cell widths. 
Since the size started from > 2B, it will remain > B. This proves the lower bounds on the size of 
G{t) and on r{t). 

, Let [ao,ai— ] x [^0+7^2^] be the wake of the damage rectangle and assume that uq G [vQ,Vn\. 
Assume also that G(uo) has size > 45 and right-age < fc at time uq. 

Then the gap path G(t) with the desired properties can be defined for t £ [uo+, U2— ]; moreover, 
G{u2) has size > 2B and right-age < k + 2(wake)/T, -I- 1. 

We also have r{t) > r{uo) — 2B for all t e [uo+, U2—]. 
Proof. Let k{t) ^k + 2{t-UQ)/T, + l. 

3.1. Assume that the interval [ao,ai] is closer to l{uo) than to r(uo). 
Then r{t) will be defined just as in the damage-free case. 
3.1.1. Let t G [?io+,U2— ]. 
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Then we set l{t) to be the first site from the left that, at time uq, is > l{uo) V ai and is 
ahgned with l{uo)- This defines a gap G{t) of size > B for this time interval. This gap 
will not decrease during [wo+,U2— ] since it is too large for Heal and Grow the damage has 



insufficient time to trigger Grow. So, the age bound reasoning of point 2.1 above applies and 
the right-age of G{t) will be < k{t). 

3.1.2. Let t = U2 and ao < 1{uq) + B. 

Then oi < Z(uo) + 2_B, and we set l{u2) to be the greatest cell < 1{uq) V ai and aligned with 
1{uq). Then l{u2) < 1{uq) + 2B, hence G(u2) has size > 2B. Again, the reasoning of point ^ 
above applies to show that G{t) has age < k{t). If l{u2) > l{uo) then no growth animation 
could have been pending in l{u2) from time before uq and this completes the proof of the 
claim in ^. 

3.1.3. Let t — U2 and ao > /(uo) + B. 

Then we define l{ u2) — 1{uq) + B. At most one cell can be added to l{uo) by growth during 
[uq, U2~]. Lemma |l3.3| (Ancestor) shows that any live cell in [oq, ai— ] can be traced back to 
a birth within G(t) or to a time in G(t) before uq and therefore the same reasoning as for the 
other cases gives that G{t) has right-age < k{t). 

3.2. Assume now that [ao,ai] is closer to r{uo) than to l{uo). 



This case is similar to case ( p.l[ ), so we point out only the differences. Now, the gap on the left 
of flo may decrease by one during [uq+,U2—], to size > 0, if a growth step occurs on the left. 
For t = U2, now the cases we distinguish are: 

3.2.1. Assume ai > ^(wo)- 

Then ao > r{uo) — B. We set r{u2) to be the smallest cell > r{uo) A ao and ahgned with 
l{uo). Then r{u2) > r{uo) — B, hence G{u2) has size > 2B, even if a growth step occurred at 
l{t) during [uo+,U2-]. 

3.2.2. Now assume oi < r{uQ). 

Then we define r[u2) — ''"{uq). Since no growth could have occurred on the right-hand side, 
by applying the earlier reasoning, we will find that G{t) has right age < k{t). 
Let us construct G{t) for t S [woji'n]- In the space-time rectang le con sidered, at most one damage 



rectangle occurs. Indeed, Vn — vq <T,* /2 follows from (y) and (|13.10| ). 

Let fo = vo and let /i be the supremum of the those t > fo until which the damage-free construction 
in the proof of || is applicable: thus, if there is no damage involved then /i = Vn, else /i = uq. 
Applying this construction, we get to /i with gap size > 4i?. Applying knowing there is no 
damage before, we find the age upper bound k + 2(t — vq)/T, + 1 and the lower bound \{t — wo)/tiJ 
on {r{t) — yo)/B. If fi=uo< vq then we apply the construction of ^ to get to /g = A U2. Now, 
the gap has age upper bound k + 2{t — vo) /T, + 2, and lower bound {t — vq — (wake))/Ti — 2 on 
{rit) - yo)/B. 

Let now /q take the role of fo and repeat the damage-free step to f[ = w„. Now, we have age upper 
bound k + 2{t — vq)/T, + 3 and lower bound {t ~ vq — (wake))/ri — 4 on {r{t) — yo)/B. □ 
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14. Attribution and progress 

14.1. Non-damage gaps are large. A right (as opposed to left, not as opposed to wrong) bad 
gap is a gap of right-age < 2(split_t)/r, and size > 5B. The Bad Gap Opening Lemma says that if 
a left exposed edge persists too long (while possibly moving) then a right bad gap develops on its 
left. This lemma will be used to prove the Bad Gap Inference Lemma, saying that, under certain 
conditions, a left exposed edge automatically has a right bad gap next to it. 

Let us call two siblings strong if either they have been siblings for at least 2T* hours or one cell is 
a parent of the other. A cell is called e.g. a weak left exposed edge if it has no strong left sibling and if 
it would be a left exposed edge in case it had no left sibling. A sequence Ri = (jjq, vq), . . . , (?/,«, Wm) 
of cell-time pairs will be called a left boundary path if it has the following properties: 

(a) yi is a weak left exposed edge during [ti+,ti^i~]; 

(b) cell yi-i dies at time ti and is a strong sibling of y^ = -I- _B at time ti — ; 

Lemma 14.1 (Bad Gap Opening). Let C be a colony and let Ri — (yo,wo), . . . , {yrmVm) be a left 
boundary path in C. Assume that the damage rectangle does not intersect [f\^yi — B,\J^yi -\- B] x 
[vQ,Vm]- Then yi has at most pi + 1 switching times during [vi,vm^i]. If m > 7 then at time Vm — , 
there is a right bad gap on the left of Ri (t) . The same statement holds if we interchange left and 
right. 

Proof. Let us show that {yi,t) is a left exposed edge for t in [vi + 2T*+, Vi+i — 2T*~]. 

Indeed, yi can be a weak left exposed edge that is not a left exposed edge only if it has a left 
sibling that is not strong. Now, if yi does not have a left sibling and it gets one then it follows from 



Lemma |12.13| (Glue) (and the exclusion of cut, since the edge does not become exposed) that the 
only way to lose this sibling is if the sibling dies again, which it cannot do before making at least po 
switches, becoming a strong sibling in the meantime. From this, it is easy to see that yi can have 
a left sibling only during a time interval adjacent to either Vi or Wi+i. Since the sibling stays weak 
these time intervals must be at most 2T* long. 

Due to the above observation, since yi is a left exposed edge after its first complete work period 
following Vi, either Purge or Decay will kill it within the following pi steps. In 7 repetitions of this, 



a gap of width 7B will be created. During this time, by (13.2), at most one growth step can occur 



on the left, leaving still a gap of size 6B. □ 

Lemma 14.2 (Bad Gap Inference). Let cq — (a;o,to) be a left exposed edge, 

D = [xo~8B,xo + B], 

/=[to-(split.t) + r.,to] 

and let Zq be the starting cell of the colony C of {xQ,tQ). Suppose that if Cq is an outer cell then 
D r\{[zQ — 2B, Zq + QB—\) X I is damage-free, else D x I is damage-free. Then one of the following 
holds: 

(1) There is a bad gap on the left of{xo,to) inside the colony o/cq; 

(2) There is a bad gap on the left of (xo, to), and a backward path {xi,ti)o<i<n of length < l{pi -\- 1) 
with the property that one of the path cells, Xi, is in the left neighbor colony. It is a weak left 
exposed edge during [ti-\-,ti-i—]. Also, either it is an expanding germ cell closer than cq to the 
origin of expansion or it is closer than cq to the originating colony. 

(3) There is a backward path of length < 7{pi -\- 1) leading from cq to a cell undergoing one of the 
changes listed in Lemma 12. iS^ (Exposing) ; 



(4) There is a backward path of length < 7{pi -\- 1) leading from Cq to a protected left colony endcell, 
just being killed by a left neighbor cell and exposing a right neighbor. 
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The same statement holds if we replace left with right. 

Proof. Let us construct a backward path {xi, ti) made up of horizontal and vertical links such that 
cell Xi is a weak left exposed edge during every nonempty time interval ti_i— ], and the backward 
path never passes into the right neighbor colony. Suppose that {xi, ti) has already been constructed. 

(1) The construction stops in any of the following cases: 

• the path has reached length 7(pi + 1); 

• the path moved 7 steps to the left; 

• (xi,ti) belongs to the left neighbor colony; 

• {xi,ti—) is not a weak left exposed edge; 

(2) If Xi has a switching time t' immediately before ti such that {xi ,t) is a weak left exposed edge 
during [t',t,] then let (xt+i^t^+i) = {xi,t'). 

(3) Otherwise, let t' be the lower bound of times t such that {xi,t) is a weak left exposed edge. If 
t' < ti then let {xi+i,t^+i) = {xi,t'). 

(4) Assume now that t' = ti. Then at time i^, cell Xi became a weak left exposed edge without 
taking an action: hence, it must have lost a strong left sibling. (Animation does not produce 
any exposed edge, see Condition I2.1Cl| (Animation).) According to Lemma 12.13| (Glue), this 



can only happen in one of the ways listed there. It is easy to check that of these, only the killing 
of the strong left sibling produces an exposed edge in Xi. Each of the rules Jfeai, Decay and 
Arbitrate that could have killed Xi — B presupposes that this cell did not have a left sibling at 
the observation time t" oi Xi — B; thus, it did not have a strong left sibling at time ti~. On 
the other hand, as a strong left sibling of (xi,ti~), it has been alive for at least 2T* hours. Let 
(xi+i, ti+i) = {xi — B,ti). Such an i will be called a right jump. 
By the construction, each jump is surrounded by vertical links. For each jump i, cells Xi and x^+i are 
strong siblings at time ti — . Let {xi^.,ti^) for ii < • • • < im be the endpoints of vertical links on the 
backward path with the property that (jfc, i^ + 1) is not a vertical link; let (x„, i„) be the last point 
of the path. Let us number these points forward in time: (yoi^o) — (x„,t„), (yi,ui) = {xi^,ti^), 
(2/2, 1^2) = 7 ii„_i), etc., creating a left boundary path. 



If the path has moved 7 cells to the left then we are done. If vq <to — 7ti then Lemma 14.1 (Bad 
Gap Opening) is apphcable, and it shows that there is a bad gap on the left of {xo,to). If the path 
has stopped for some other reason then we have the following cases. 

(1) {xn,tn) is a weak left exposed edge belonging to a neighbor colony. 

(2) {xn,tn—) is not a weak left exposed edge. 

In case |l|, the death of (x„,t„) must have created the weak left exposed cell (a;„_i,t„_i) that was 
a left colony endcell. This is possible only if either {xn,tn) is an expanding germ cell closer than cq 
to the origin of expansion or it is some other kind of cell in the originating colony of cq. Continue 
the construction of the backward path for another 7{pi + 1) steps (it is easy to see that now it will 
not be stopped earlier) and apply the Bad Gap Opening Lemma, showing that the present lemma 
is true for this case. 



In case y we have either one of the cases in Lemma 12.15 (Exposing), or (x„,t„— ) dies at time 



tn as a left, non-exposed edge. □ 

The following lemma shows that a colony always makes some kind of progress in the absence of 
damage. 

Lemma 14.3 (Small Progress). Let C be a colony with starting cell zq and to <ti. If it is covered 
by a domain whose originating colony it is at time t then let E{t) be the maximal such domain. 
Suppose that no damage rectangle intersects [zq — 8B, zq + {Q + 7)B] during [to — (split_t), <i] and 
E{t) exists during this time. 
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Then one of the following statements holds: 

(1) The minimum value of Age in E(t) increases at least {ti — to — 2(split_t))/ri — 3 steps during 
[to-,ti—] (the minimum must be defined "locally", taking into account the possible crossing of 
work period boundary). 

(2) an exposed edge appears in E{t') at some time t' £ [to,ii] otnd moves towards decreasing E{t) 
either until it reaches the end of C or at least by B(ti — t')/Ti — B; 

Proof. If both endcells of E{t) are protected during [t p, ti] then the only thing preventing the increase 
of age in a protected ed ge, a ccording to Condition |ll.4| (Address and Age), is when x is frozen. 



According to Condition 12. 3| (Freeze), this x can become frozen only when it has a non-dying 



partner. But Lemma 14.2 (Bad Gap Inference) would be applicable to this partner as an edge 
turned toward x and it is easy to verify that in the present case, this lemma implies an impossibly 
large gap between the partners. Therefore the minimum age of E{t) increases every ti steps. 
Suppose now, without loss of generality, that the right end of E{t) is exposed at some time t' G [<0j ti\. 



Lemma 14.2 (_Bad Gap Inference) implies the existence of a bad gap on the right of Eito) unless one 
of the cases (||) or (^ occurs. If none of these cases occurs then Lemma 13.5 (Running Gap) will 
widen the gap as predicted in (||). 

1. Suppose that case (|^) of the Bad Gap Inference Lemma occurs. 

Thus, there is a backward path of length < 7(pi + 1) leading from Cq to a protected right colony 
endcell x, just being killed by a right neighbor cell and exposing a left neighbor. There are only 
two ways this can happen. One: when x is the endcell of a growth and is killed by the end-healing 
of member cells of another colony. This case does not really occur. Indeed, let y be the left 
exposed cell trying to do the end-healing. The Bad Gap Inference lemma can be applied to the 
y and would imply now, when none of the distracting cases applies, the existence of a large gap 
between x and y. 

The other case is when x is the endcell of a germ. Then the exposed edge thus created will never 
become protected, and the Bad Gap Opening lemma implies the creation of a bad and growing 
gap within (split_t) hours. 

2. Suppose that case ( 0) of t he Bad Gap Inference Lemma occurs. 

The cases in Lemma |1 2. 15 fall into two categories. In all cases but (|) and (|), Lemma |l4l] (Bad 



Gap Opening) applies to the development forward in time from this event, showing that again, 
the exposed right edge moves left, creating an unhealable gap. 

In case (^), a non-end growth cell at Age = (grow_end) becomes exposed. The edge may later 
disappear by healing, but only by end-healing. Indeed, for internal healing a close partner would 
be needed but the Bad Gap Inference Lemma would show (without the distracting other cases, 
this time) that the partner cannot be close. In order to prevent a bad gap from opening, the 
end-healing must succeed within (split_t) hours. Therefore this kind of exposed edge will exist at 
most for a time interval of length (split_t) on the right and for a similar interval on the left. The 
same reasoning applies to case (^. 
We found that in three intervals outside the possibly two exception intervals, the minimum age of 
E{t) increases every ri steps. □ 

14.2. Attribution. 

Lemma 14.4 (Cover). Let Pi be a backward path contained in colony C, starting at time to, and 
passing through the nonempty time interval I ~ [vi,to — Qti] . There is a union Ai of three intervals 
of size (split_t) such that we have one of the following cases, with co = {Pi{to),to). 

(1) At all times vo in I \ Ai, the cell Pi(wo) is in a domain that has no exposed edges in C. 
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(2) There is a time vq in I \ Ai at which Pi can be redirected using at most 2Q links, to reach the 
originating colony ofca; 

Proof. Let be the set of elements t oi I such that a damage rectangle intersects C during [t — 
(split_t) + T,,t+ (split_t)]. Then Aq is covered by an interval of size 2(split_t). At time uo in / \ Aq, 
assume the domain of Pi{vo) has exposed edges in C. If the colony is not an originating colony 
we can choose the edge to look towards the originating colony. Indeed, if there is no exposed edge 
pointing in this direction then we can redirect the path by a horizontal stretch to the originating 
colony, as in case (||) of the lemma. 

Without loss of generality, assume that our edge is a left edge t/o- The conditions of the 
Lemma 14.2 (Bad Gap Inference) are satisfied for {yQ,V()) as the cell (xo,to) in that lemma. Case (|l|) 
of the conclusion of the lemma does not happen since it would satisfy the conditions of the Running 
Gap GoroUary for (yg, vq): hence tg — vo < Qti would follow, contrary to the definition of /. Gase (^) 
implies case of the present lemma and it is also easy to see that then co is not a member cell. 

As in the proof of Lemma 14. 3| (Small Progress), we can show that case (1) of the Bad Gap 



Inference Lemma does not occur. Gonsider case (|3|) of that lemma. As in the p roof o f the Small 
Progress Lemma, we can conclude that cases other than (^) and (^) of Lemma |l2.15| (Exposing) 
do not occur. These remaining cases result again, in at most one additional time interval J f size 
(split_t) to be excluded (not two, since the event in question can occur at only one end of the colony 
C). Thus, let Ai=AoUJ. □ 

If the Gover Lemma is applied with tg — wi > 2Qti then the set I \ Ai is not empty: indeed, 
see ( pJ^ . 

Remark 14.5. In case cq is a germ cell with Age < (germ_end) — 2Q, further / is defined as / = 
[wi,<o ~ SQti] and then we can conclude similarly that there is no weak left exposed edge even 
outside C. Indeed, then the domain in question is of size < 5Q and the Lemma 13. 5| (Running Gap) 



would erase it in 5Qti hours. {) 

Lemma 14.6 (Gover Ancestor). Suppose that Cx[vo, vi] is not intersected by any damage rectangle, 
and at time vi , colony C is covered by a domain with no exposed edges, consisting of member cells or 



of internal germ cells older than (germ_grow_end) . Then, defining Aq as in the proof of Lemma 14-. 4 
(Cover), at all times u in [vo,vi\ \ Aq, the colony is covered by a domain that has no exposed 



Proof. We can follow the proof of the Gover Lemma for the special case considered here: that the 
domain is only over C, consisting of member cells or of internal germ cells. It is easy to verify that 
applicable cases of the Bad Gap Inference Lemma remain that lead to a widening gap. □ 

We will say that cell (xo,to) is attributed to colony C if there is a path Pi going back to time 
to — 5Qti and a union E of at most 3 intervals of length (split_t) such that Pi{t) e C for t in 

(14.1) /= [io-4Qri,fo-3gTi], 



and C is covered by a domain without exposed edges for all times in I \ E. In view of (13.4), this is 
the majority of times in I and therefore if (xq, to) is attributed to colony Cq and {xi , to) is attributed 
to colony Ci then Cq and Ci either are disjoint or are the same. 

Lemma 14.7 (Attribution). Assume that the live cell cq = {xo,to) with colony C is not a germ and 
is not in the wake of a damage rectangle. Then we have: 

(a) It Co is a member cell then it can be attributed to its own colony. 

(b) // Co is an outer cell then it can be attributed to its originating colony. 
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(c) // cq is an outer cell older than (grow_end) + 2QXpi and not adjacent to the originating colony, 
then it can be attributed to its own colony. 



Proof. Without loss of generality, assume that cq is not to the left of the center of its originating 
colony. 

Let us build the path Pi = (co,ci,...) with q = (xi,ti) backward in such a way that vertical, 
parental and horizontal links are chosen in this order of preference. Horizontal links are chosen only 



in the case of the application of Lemma 13.3 (Ancestor). Whenever we have a choice between father 
and mother we choose the parent towards the center of the originating colony of cq or towards the 
center of the current colony, as the construction requries. 

1. Assume that cq is not an outer cell younger than (grow_end) + 2QXpi and one of the following 
holds: 

(1) Pi does not leave C during [to — 2QTi,to]; 

(2) Co is not adjacent to its originating colony; 

(3) [to — (split_t), to] is damage-free; 

Then it can be attributed to its own colony. 

Proof . Let us direct the path always towards the center of the current colony. If the path did 
not leave C during [to — 2QTi,to] one finds, applying the Cover Lemma, some time Vo during this 
period when the colony is covered by a single domain. At time vq, we can extend the path along 
horizontal links to the center of the colony C of co. Continuing backward from there by the original 
method, the path will never reach the edges of the colony, since the only way for it to move away 
from the center is by one horizontal link near a damage rectangle, and this can only happen once. 
Hence an application of the Cover Lemma finishes the proof. 

If Co is not adjacent to its originating colony then the backward path constructed by our method 
never leaves C during [io — 2Qti, to]. Indeed, there is at most one horizontal link in the path during 
this time, and the parental links do not move the path away from the center. 

If neither of the above conditions holds but [to — (split_t), to] is damage- free then co is an endcell 
of C during this time. It must have a sibling in C sometime during this interval since otherwise it 
would be killed by Decay. We can redirect the path into the sibling and continue from there. 

2. If Co is an outer cell then Pi can be redirected to the originating colony during [to — 2Qti, to] and 
then continued there without leaving it again, as above. 

Proof. Without loss of generality, suppose that co is a right outer cell. The Cover Lemma implies 
that for J = [to — 2QTi,to — Qti], at all times vo in J \ A\ with A\ as defined there, the cell 
Pi(wo) is in a domain that has no exposed edges in C At any such time vo the leftmost cell of C 
belongs to the domain since otherwise the left edge of the domain would be exposed. If it has a 
sibling in C — QB then let us then direct P\ to this sibling. According to |], P\ can be continued 
from here without having to leave C — QB again. 

If the left end has no left sibling then it must be already a member cell in a new work period. 



Lemma |14.2| shows that during J \ A\ there are no c ells to the right end of the domain that 
could prev ent t he increase of age as in Condition 11. 4| (Address and Age). Therefore according 



to Lemma 14.3 (Small Progress), the minimum age of cells in the domain keeps decreasing along 
the backward path. But when it decreases by more than Q the left edge of the colony is also an 
outer cell and has a left sibling in C — QB to which the path can be redirected. 
, Assume that Co is a member cell, the interval [to — (split_time), to] is not damage-free and P\ 
leaves C during this time. Then co can still be attributed to its own colony. 



118 



PETER GACS 



Proof. It follows from the cases already considered in[|, that we can assume that the path leaves 
C on the left and cq is the left endcell of C, that before leaving C the path is vertical and the cells 
on it do not have any right sibling. The path leaves C along a horizontal link (cp-_i,Cp). Indeed, 
suppose it is a parental link. The rule animating Xp^i could not be the growth rule since co is a 



member cell and due to (13. IC), the path is not long enough to get the age from the expansion 



period to the end of the work period. The healing rule does not act across colony boundaries. 



The horizontal link across the colony boundary is created as in the proof of the Lemma 13.3 
(Ancestor), therefore the extended damage rectangle is near the colony boundary, near time tp. 

If Cp is an outer cell then ^ implies that the path can be redirected within the desired time to C. 
Let us show that Cp is not member cell. It could be one only if Pi crosses into C — QB at an age 
when C is a newly created colony covered with outer cells and Cp is in the originating colony. Let 
i < phe such that Ci_i is still a member cell but Ci is already an outer cell. Then the transition 
from Ci to Ci-i involves a transition from outer to member status at the end of the work period. 
This cannnot happen, however, under our assumption that Ci has no right sibling. 

□ 

14.3. Progress. The lemma below says that under certain conditions, the age of a colony increases. 

Lemma 14.8 (Large Progress). Let Ci — (xi,Wi) he of colony C with starting cell Zq, within B of 
the center of C and not in the wake of any damage rectangle. Assume that either Ci is a member or 
it is an internal germ cell with Age > (germ_grow_end) + lOQXpi. There is a vq and a forward path 
Pi in C X [wo,t'i] such that defining 

(14.2) Xi = Pi{vi), Ci = (xi,Wi), 

(14.3) A = Age(ci) - Age(co), 

we have vi — vq < 5Qti , further either Q < A or the path crosses a work period boundary. Cell cq 
is within distance B of the center of C, the latter is not in the wake of any damage rectangle, and C 
is covered by a domain at time vq. 



Proof. The proof is a repeated application of Lemma 14.3 (Small Progress), with some case distinc- 
tions and delays due to damage. Let us build the path Pi backward like in the proof of Lemma 14.7 
(Attribution), tiU 

vo ^ vi - 3Qti. 

This path does not leave C. Indeed, it could only leave if there was a number of steps in which the 
path moves towards the edge of the colony. Each such step but one must be a parental link. If the 
animation was via healing then the healing would be an internal correction and the parent towards 
the center would be chosen, so this is excluded in all cases but one. The animation cannot be via 
growth since due to ( 13.10| ), the age of cells along the path Pi cannot decrease enough for them to 



become growth cells. 

Let E[t) be the maximal domain containing Pi{t), and let m{t) be the "local" minimum value of 
age in E{t) (taking into account the possible crossing of work period boundary). Let I = [vo,vi — 
Qti]. According to the Cover Lemma, there is a set Ai coverable by 3 intervals of size (split.t) such 
that for all t in / \ Ai the colony C is covered by E{t). If vq G Ai then let us change vq to a little 
larger value, so that vq ^ Ai. Then Pi can be led back to the center at time vq. It remains to show 
A > 2Q. 

Let us represent J \ Ai as a union of at mo st 4 disjoint closed intervals [si, fi] {i — 1,2,...). In 
each interval [si, fi], according to Lemma 14. 3| (Small Progress), m(t) keeps increasing to 



(t-Si- 2(split_t))/Ti - 3 
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until an exposed edge (say, a right edge) occurs at some time ti. By the same lemma, for t £ [ti, 
this edge moves at least {t — ti)/Ti — 1 cell widths till time t towards C unless it disappears earlier 
by reaching C. The damage can stop this process and move right the end of E(t) by at most 2 
cells. Therefore except for a union Ji of two time intervals of a total length of {Q + 2)ti accounting 
for right exposed edges and another exception set J2 accounting for left ones, part |^ of the Small 
Progress Lemma is applicable. 

The set / \ (^1 U Ji U J2) breaks up into a disjoint union of at most 6 intervals [s^, //]. The 
domain E(t) covers C and has no exposed edges during these intervals, so m(<) increases in them as 
in the Small Progress Lemma, and the total increase during / is at least 

(14.4) {vi -vo- {3Q + 4)ri - 7(split_t))/ri - 14 > Q. 

□ 



15. Healing 

15.1. Healing a gap. 

t' > t a {x',t') is reachable by a forward path from some {x,t) for some x in E. 

Lemma 15.1 (Healing). Let [ao,ai] x [uo+,ui— ] be an extended damage rectangle. Let E be a 
domain at time uq with the property that for each locally maintained field F such that the age of no 
cell of C is in an update interval for F (let us call these fields "relevant" ) , the latter is constant over 
C at this time. Let C — {zt^ + iB : i = Q, . . . ,(5 — 1} be a colony with which the addresses in E are 
aligned, with oq G [zo,zo + QB/2—]. Then the following holds. 

(a) At time wq = uq + 4ri, some multidomain in [ao — QB/2, ao + QB/2~] contains all successors 
of E n [ao — QB /A, ao + Q-B/4— ] from time uq; 

(b) // E covers C at time uq and all cells of C are member cells then a domain covers C at time wq 
and each relevant locally maintained field is constant over C at this time; 

Proof. 

1. Let us prove (^). 

The damage can change only a single cell x of E. If this cell is at an end of E then there is 
nothing to prove. Otherwise, x — B and x + B are also in E. Assume first that both x — B and 
x + B survive until wo- Then we have a case of internal correction; let us show that this correction 
succeeds. If an internal correction of a; — 5 is also possible before internal correction starts in 
x (it cannot become possible after the correction of x started because either x was dead or the 
correction of x starts with killing x) then the reasoning for x can be applied to x — B. Suppose 
therefore that internal correction of a; — i? is not possible. The correction begins by killing x (in 
case it is not mild). Let us show that it succeeds by reanimating it. If a: — B is not a left edge 
any time before Wq then the internal correction of x will succeed since the animation, needing a 
neighbor that is not isolated, can use x — B. Similarly ii x + B is not a right edge any time before 

Wq. 

Let us show that if each of these cells is an edge pointing away from x then one of them dies 
before wq contrary to our assumption. Certainly, one of them is an exposed edge, suppose that 



e.g. X — B is a, left exposed edge at some time t in [uq, wa — ti]. Lemma |14.2 (Bad Gap Inference) 



implies that either there is a bad gap on the left of a; — i? or there is a backward path of length 
< 7{pi + 1) leading from {x — B,t) to a cell in the same colony that changes, at age (grow_end), 
from a protected to an exposed left edge, or undergoes a planned kill. If there is a bad gap of 
the indicated size then a; — -B is an isolated cell during [t, wq] . It is easy to see that it is not an 
endcell of a near-end correction and therefore it will be purged by the time wq, contrary to the 
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assumption that it stays alive. Suppose now that a backward path of length < 7(pi + 1) leads 
from {x — B,t) to a cell in the same colony that changes, at age (grow_cnd), from a protected to 
an exposed left edge. Then it is easy to see that x + B (which also is a right edge at some time 
before wq) is an exposed cell for which the Bad Gap Inference Lemma really infers a bad gap on 
its right, resulting in its purge. 

Suppose now that e.g. x — B dies before wq. If there will be no cells after wq that can be traced 
back to E only bey tracing them through the left or x, then we are done. If there are at any 
time t > uq, then they must be connected by parental and horizontal links to x — B, therefore 
they form a domain from x — B left. This domain can only be killed from the edges, x — B 
cannot be the last element to be killed since then the whole domain disappears by wq. Therefore 
there must be moment t' when x — B and x — 2B are alive and x — B will be killed. The only 
rule accomplishing this is Decay, if a; — i? is a right exposed edge and the gap in x cannot be 
healed. Since a; + -B is still there this could only happen if no internal correction is possible in 
X, since otherwise this internal correction would have succeeded already before the killing. The 
only obstacle to the internal correction is if x + B has no right sibling and is not a right protected 
endcell. 



It is easy to see using the Lemma 14.2| (Bad Gap Inference) that there is no domain on the 



right of E with which the successor of E could merge in the given time period. Also healing the 
gap caused by the death of a; — i? is not possible since (it was created due to the impossibility of 
healing to begin with) . Therefore the the whole domain containing x + B (and possibly containing 
X and even x + 2B if the latter arose by growth in the meantime) decays within further 3ti hours. 
, Let us prove (^ now. 

Suppose that E covers C at time Ug and all cells of C are member cells. Let x be the cell changed 
by the damage. If it is not an endcell or next to an endcell of C then (given the absence of planned 
kill) the change caused by the damage will clearly be corrected by the rule Heal. 

Suppose that x — -B is a left endcell. Then the only obstacle to healing x is if there is a competing 
internal correction in x — B that would create a member cell in a; — i?. This cell could only belong 
to a colony C different from C since a; — _B is an endcell in C. Since there is a correction in a; — i? 
the cell X — 2B must belong to C It is clearly exposed to the right at the observation time of the 
correction (since the correction would create a member cell, a protected edge is exclud ed). It must 
have been there as an exposed cell for a significant time. Indeed, according to Lemma 14. 6| (Cover 



Ancestor), C has been full for a long time, the growth could not have created x — 2B during quite 
a long interval before uq, and healing could not have created it while x — B was an endcell of C. 
But it could not have been there long since the Decay rule or the Purge rule would have killed it. 

Suppose now that a; is a left endcell. Then it is part of an end-correction. According to the 
healing rule, one possible obstacle to carrying out the end-correction is that there is an internal 
correction in a; -|- B. In this case, x is the endcell of a near-end correction, therefore it will not be 
purged, and the near-end correction will be carried out. All other possible obstacles listed in the 
healing rule imply that x — B has been right exposed cell for such a long time before uq that it 
would have died. 

If the healing rule tries to correct x it still must be proved that it succeeds in doing so. If x is 
not latent then first x will be made latent, then line (^) of the rule Heal in Subsection 12. 5| applies 



to cell x+B to make a member cell in x by setting Kind-i.Heal := Member and triggering thereby 



rule Animate of Subsection 12.4. We must show that the animation of the latent x succeeds. The 



only obstacle to the animation would be an end-healing of member cells of some other colony C 
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from the left. But this cannot happen since then the rightmost Hve cell of C must have been 
exposed before the damage, and Lemma 14. 2| (Bad Gap Inference) would imply that it has a large 
gap on its right, excluding right end-healing. 



If X is vacant then first it must be made latent by rule Create of Subsection 12.3: for this 



Lemma 12.8 (Creation) can be used, 
healing. 



Again, no obstacle will be posed by any opposite end- 



Adding the time upper bound in the Creation Lemma to tq, we get the upper bound 4ri for 



the healing time. Due to to (13.1), this is still smaller than the time needed for the decay rule to 
kill any cell made exposed by the damage. 

□ 
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16. Computation and legalization 
16.1. Coding and decoding. 

16.1.1. Main track and redundancy track. Assume that the transition function Tik and has band- 



width w = Wk as. defined in 2.2.1, and similary the simulated transition function Tr^+i has band- 
width Wk+l- 

The state of each small cell is partitioned using Buf ~ Inbuf U Outbuf, etc. Also, the field Info 
is partitioned into subfields Info. Main and Info.Redun. Each of the fields of a small cell mentioned 
in the construction of the previous sections, with the exception of Info. Main, will be part of Buf. 
Let 

P = Capk^j^/wk+i- 

Assume w.l.o.g. that this number is integer. We partition the state of the represented cell into 
P fields called packets of size Wk+i. We also break up the field Info. Main into fields Info.Maini, 
i — 0, . . . , P — 1 oi equal size. Assume without loss of generality that 

(16 1) C'^Pfc < C'aPfc+i < g C'apfc 

Wk ~ Wk+l ~ Wk 

The first inequality implies that the width of track Info.Maini is less than the bandwidth of a small 



cell. The second inequality will be used in deriving (16.7). (In case the first inequality was violated, 
we could distribute a packet over several tracks whose width is the bandwidth, and otherwise the 
algorithm would be similar. In case the second inequality was violated, we could put several packets 
onto each tracks whose width is the bandwidth, and otherwise the algorithm would be similar.) 

Let us also break up the track Info.Redun into locations (segments) dnf o.Redunj, z = 0, 1, . . . , P— 
1. Our error-correcting code will distinguish, similarly to Example |4.§| , information symbols and 
error-check symbols. Each packet of the represented state will be encoded separately. Track 
Info.Maini will contain the information symbols, and location _Info.Reduni contains the error-check 
bits of packet i. Let the location 

Jnfoi 

denote the union of track Info.Maini and location dnfo.Reduni. These together form the error- 
correcting code of packet i. The first three packets represent Inbuf*, Outbuf* and Pointer*. 
We will work with a 12-error-correcting code 

(16.2) a 



as in Example 4.6, with Qk information symbols. Since that code can correct errors in t symbols 
with just 2t error-check symbols, so for each i, the location _Info.Reduni is only 24 cells long. When 
ioc is a location representing a string s then we will use the notation 

a*{loc)=a*{s) 

for the value decoded from it. 

The computation result will be stored temporarily on the track Hold. This has, similarly to 
Info, also subfields Main and Redun. This computation result corresponds to the output of Tr"''°+\ 
Recall that the latter consists of 5 segments of size Wk+i . The first three segments determine the 
new values of Inbuf*, Outbuf* and Pointer*. The fourth segment determines the new value of a 
part of the cell state whose position is determined by the fifth segment. The output of the simulating 
computation, on the Hold track, will correspond to the first four of these values, in four subfields. 
The fifth one will be kept in a locally maintained field called 

Target_addr 
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Info.Maino 
Info.Maiiii 



_Info.Rcduni 



_Hold.Redun2 



.Hold.Redun„ 



Info.Mainp^i 

Info.Redun 

Hold.Maino 



Hold.Main3 
Hold.Redun 
Target^addr = n 



Figure 17. The Info and Hold tracks] subdivided into packets, with P = 6 and 
n = 5. The location ^Infoi is the union of the framed locations Info.Maini and 
.Info.Reduui. 



which is supposed to have a constant value throughout the colony. Track Hold.Redun is split into 
locations JJold.Reduni for i = 0, . . . , P — 1, most of which will be unused. For i = 0, 1, 2 we define 
again Jioldi as the union of the track Hold.Maini and _Hold.Reduni, similarly to how _Infoi was 
defined. The redundancy bits for Hold.Main^ will be put, however, not into ^Hold.Redun^ but into 
-Hold.Redunn where n = Target_addr (here, n has the same meaning as in 2.2.1). Correspondingly, 
we define 

Jlolds = Hold. Mains U JIold.Redunn. 



16.1.2. The code of the amplifier. Recall (13.13). If ip*{r/){y,t) ^ Bad then let us define 

r]*{y,t) = ip*{y,t) = Vac 

unless there is a time vi in [t — (crit),t] such that the colony C{y) is covered with member cells 
belonging to the same work period at time vi and is not intersected by the extended damage 
rectangle at this time. In case there is such a time vi let t* be the last time before t with this 
property that is also the switching time of a small cell (just to make sure there is such a last time). 
Let 

s^T]{Ciy),t*).Info 



(the meaning of this notation should be clear). If s contains at most 2 errors then (see (16.2)) 

else it is Vac. The look-back to an earlier time in the definition avoids some oscillation between the 
different states assigned to the large cell that could occur as the result of damage and asynchronous 
updating. 
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The encoding takes the state of a ceh, appUes a* and copies the resuh to Info track of a colony. 
The other parts of the colony are set in such a way as they are to be found at the beginning of a 
work period. The encoding (^*, is like except that all tracks but Info will be filled with the don't 
care symbol *. 

16.1.3. Coding and decoding rules. Let 

Vacant^str 

be a string representing a vacant state in our code. Let 
(16.3) Codej3ize{k) = \Ro/Ri{k)] 



where Rq and Ri{k) were defined in |9.1.2| and [9.1.3 . Let Decode-prog be a program for the medium 



Univ such that location JDecode_output on track Cpt. Output after applying 

Univ{Decode_prog U S; Q, 2Q) 



contains a*{S) (see (16.2)) if a^,{a* {S)) differs from S in at most 12 symbols: otherwise, it contains 



Vacant^tr. The decoding rule takes a string from location 7oci, decodes it and copies the result to 



location loc2- The rule Write was described in Subsection 7.4 

rule Decode{loci,loc2) { 
Cpt. Input :— *; 
Write{Decode.prog, -Prog) 
Copy{0, loci , _Decode-arg) ; 
apply the rule Univ 2Q times to Cpt 
Copy{0, _Decode_output, I0C2) ; 
} 



(Recall the assumption made in 7.2.1 that Univ is commutative, so we do not have to worry about 



the order of execution in different cells.) The rule 

Encode{loci , I0C2) 

performs encoding. The locations l ocj in the encoding and decoding rules will be allowed to be given 



indirectly, as explained in Remark 7.1C . 



16.2. Refreshing. In each rule of the present section, the operation 

2 

Maj Sj 

i=0 

when applied to strings sq, Si, S2 is the result of taking the bitwise majority of these three strings. 
We will have a general type of rule 

Checkoiprop,FiI),Xi,X2,...) 

checking some global property of some parameters Xi,X2, . . . each of which is either an explicit 
string or a location. Here, prop is some program for the universal computing medium such that 
after we run the rule of that medium for Q steps the bit b representing the outcome of the check 
will be broadcast into field F of every cell of interval /. We will also need an error-checked version 
of CheckQ which uses some fields Votei for i = 0, 2, 3: 

rule Checki {prop, F{I), Xi, X2, . . .) { 
for i e {0, 1,2} do { 
( 1 ) Checko {prop, Vote,{I), X^, X2, . . .) } 

cond { 

? Addr e I 
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! F := Maj^^o Votei 
}} 

The rule 

Checko{nearly.cqual, F(/), loci,loc2, d) 

checks whether the strings in locations loci and I0C2 differ in at most d code symbols. The subrule 
Refresh attempts to clean the information on the Info and Hold tracks by decoding and rc-cncoding 
it. It also attempts to make sure that these tracks represent at least something, even if this is only 
the vacant state. 

Rule UpdateJoc-maint updates the locally maintained fields 

Doomed, Growing^ (j £ {—1, !})• 

(It need not update the fields Controlo,ij to be introduced later.) Rule 

(16.4) Checkoirecheck_l,F{I), G,d) 

checks whether track G is covered with I's with possibly d cells as exceptions. As usual, the argument 
F means F([0, Q — 1]). The rule uses some additional locations and fields: _Decodedo .Encodedn for 
n = 0, 1, 2, Ck_reSn for n = 0, 1, 2. 
subrule Refresh { 
Ck_res := 1 ; 

for i = to P - 1 do { 
for n = to 2 do { 

Decode{-Infoi, -Decodedo) ; 

Encode (^Dccodeda, _Encodedn) ; 

Checko{nearIy.equal, Ck_reSn, -Encodedn, -Infoi, 8) ; 

Idle Q steps } 
J^ncoded := Maj^^Q -Encodedn ; 
Ck_res := Ck_res A Maj^^Q Ck_reSn ; 
Copy{_Encoded, -Infoi) ; 
Idle Q steps } 
( 1 ) Checki {recheckA , Ck_res, Ck_res, 2) ; 
cond { 

? Ck_res = 

! Wri te{ Vacan tstr , Jnfo ) } 
Repeat the whole procedure with _Hold in place of Jnfo, "mutatus mutandis" . 
} 

Let us say that a string s of symbols that can be the argument of a certain error-correcting 
decoding has d errors if, when applying the decoding and encoding to it, the result differs from s in 
d symbols. 

Lemma 16.1 (Refresh). Assume that we have a full colony in which all cells have ages before the 

starting point of Refresh. Then we have: 

(a) At the end of Refresh, for all i, locations dnfoi, have at most 4 errors; 

(b) Suppose that at the beginning of Refresh, for all i, locations Jnfoi, have at most 4 errors. Then 
during the whole Refresh, the value a*{-Infoi) does not change and -Infoi has at most 8 errors. 

The same statements hold for location ..Hold. 

Proof. We perform the analysis only for Anfo. 

1. For any i, the "immediate" effect of the damage can affect at most 4 symbols of Jnfoi. 
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Proof. The damage rectangle can have immediate effect in the cell where it happens, and its 
healing can change possibly one more cell next to it. If the affected cells happens to contain the 
symbols from Jieduni then this can affect two information symbols and two error-check symbols. 

2. Assume that the damage rectangle occurs before point (|l|) of the rule. 

Let us look at the damage-free operations in Checki after (|l|). The first such operation turns 
Ck_res into all or all 1. If Chores = then the last step writes Vacantstr into -Info, and thus 
the latter will have no errors. 

Suppose therefore that if Chores ~ 1. Then before Checki, we have Ck_res = 1 in all but two 
cells. Let us look at any cell x not affected immediately by damage (the large majority is such). 
Since the majority step computing Ck^res is correct in x, for all i there must have been two values 
of n for which Checko wrote the value 1 into Ck_reSn{x). For at least one of these n, say ni, the 
computing period {i,ni) is damage-free. That period wrote an error-free string into _-Encoded„^ , 
and also determined that this string does not differ from _InfOi by more than 8 symbols. There is 
another damage- free period (i, ?^2)■ Its Jnfoi could have been different at its beginning (if damage 
occurred between the two periods), though according to |l| by not more than 4 symbols. Therefore, 
since our code corrects 12 errors, we have ^Encodedn^ = _Eiicodedn^ . Except for the immediate 
effect of new damage, this is going to be the new value of -Infoi, therefore there can be at most 
these 4 new errors at the end of Refresh. 

Suppose that at the beginning, each dnfoi has at most 4 errors. By 0, the damage rectangle 
affects at most 4 cells of dnfoi immediately. Therefore during the whole Refresh, the value 
a*{-Infoi) does not change and dnfoi has at most 8 errors. 

3. Assume that the damage rectangle occurs on line (|l|). 

Then the damage did not occur before and therefore the track Checkjres contains either all O's or 
aU I's. 

Suppose the damage has occurred before line (^ of rule Checki, in iteration n of that rule. 
Then only Voten{I) will be affected, and possibly two more cells, the cell where the damage 
occurred and a cell next to it due to healing. In the 2 other iterations, the track Check_res still 
contains either all O's or all I's with these exceptions, and the result of these 2 iterations is the 
same as it was without the damage in the third iteration. The majority vote brings out the same 
result, except for possibly 2 immediately affected cells. 

If the damage occurs after line in rule Checki, i.e. in the last step of the rule, then again only 

the cells immediately affected will not have the correct result. 

If the damage occurs after line (|l|) in rule Refresh then again, this being a one-step action, only the 
cells immediately affected by the damage would show up as errors. □ 

16.3. Computation rules. After retrieval from neighbor colonies (to be discussed later), the string 
returned from neighbor m will be in some location 

-Retrievedm ■ 

These strings are inputs to the simulated transition function but still in encoded form. After decod- 
ing, the result will appear in _Decodedm 



16.3.1. Evaluation. As seen in (2^), only the field Buf* of the neighbors is found in ^Retrievedr, 



for m ^ 0. For m = 0, the location -InfOa is also needed, where a is determined by Pointer*, 



which according to 16.1.1 is found in _Info2. Thus, first we broadcast this number a into a locally 
maintained field 



Source.addr 
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of each cell of the colony, and then use indirect copying as in Remark 7.10 to copy -Infoa into the 
appropriate part of _Retrieveda. 



Then, the subrule Eval finds the intended output of Tr[^,, as defined in 2.2.1, using the universal 



' fe+i 

computation, as in Decode above. Here are the details. 



We are proving Lemma D.4 (Amplifier), so what is given is an amplifier frame Frame, and the 
number k, and we are building the transition function of the medium Mi~. Assume that Frame is 
described in a string 

Param2 — Frame, 

the parameter k is described in a string 

Par ami — k, 

and the program we are just writing is Paramo. First Eval computes Tr^'^^. It uses several new 



locations, some of which were introduced in 7.4.5 : Jnterpr, _Parami for Param,;, _Arg„j. The result 
will be deposited in ^Sim^output^ for i — 1,2. 

Cpt. Input :— *; Write {Interpr, _Interpr) ; 

Write (Paramo ) -Paramo) ; 
Write (Parami + 1, -Parami) ; 
Write(Param2, _Param2) ; 
f or m 6 NbJnd do { 

Copy{0, _Decoded„i , -Arg^^ ) } 
apply Q times the rule Univ to Cpt 
The result is in _Sim_outputi . 
Copy{0, -Sim^outputi, _Sim_output2) 

As seen in (p.^), in the output string of Tr^^!^^, only the first segment of size Wk+i (corresponding to 
field Inbuf*) depends on the neighbors. To make sure that really this is the case, the above sequence 
of operations will in fact be performed twice, with the following differences. 

• The first time, for all m ^ 0, the string written in _Arg,„ will not be a copy of Jietrievedm but 
simply the corresponding part of yacant_str. Thus, the result does not depend on retrieval 
from neighbor colonies; 

• The second time, only field Inbuf* of ^Sim^outputi will be copied into the appropriate part of 
-Sim_output2; 

According to |9.1.1| , the complete transition function is combined from Sim_trans and Rd-trans. To 
make sure this is the case, we recompute Rd-tranSk+i and perform the combination. Thus, now 
the rule Eval finds out from _Sim_output2 whether the new big cell is going to be vacant. If not 
then Rd_tranSk+i is computed somewhat similarly to Sim_tranSk+i above. The main differ ence 
is in what we copy into Paramo. According to the complexity upper bounds in Subsection ^.l| , 
the sequence Rd-tranSi of functions is uniformly simple. Therefore Frame contains a program 
Rd-transjprog computing Rd-transk+i on the universal medium Univ from fc + 1 and its arguments 
in time i?o|lSfc+i|| and space i?o|lSfc+i||. We will therefore copy RdJ^rans.prog into _Paramo. The 
result will be stored in Jid-0utput2. 



Finally, we combine Sim.out put^ and Jld_output2 as required in 9.1.1. into a location 
.EvaLoutput. In accordance with ( ^.7[ ), this location can be broken up into segments _EvaLoutput^ 
for z = 0, 1, 2, 3, 4, of size Wk+i- Locations -EvaLoutput^ for z = 0, 1, 2 are supposed to be the first 
three packets of the new big cell state. .EvaLoutput^ is also supposed to be the new value, to replace 
some packet Jnfo.Main„ of the new cell state, where n — ^EvaLoutput^. To use this information, 
the number n will be broadcast into a field Target-addr of each cell. 
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Remark 16.2. It seems wasteful to compute Rd_transk+i twice: once as part of Tr^+i and once 
separately. To avoid this, in the first computation we can replace Rd-trans_prog with a trivial 
program that does not do anything. ■(> 

This ends the definition of the rule Eval. 



16.3.2. The computation rule. In the rule Compute, the evaluation process will be repeated 3 times: 
in repetition n, we code .EvaLoutput onto track Hold.voten, and finally Hold will be obtained by 
majority vote from Hold-votCn {n = 0, 1, 2). 

In the rule Randomize, the Rand bit of the first cell of the colony is taken and an a*-code of it is 
placed into the appropriate location on the Hold track. This is done only once, without any attempt 
of error-correction. 

Rule UpdatcJocmaint will be discussed below. 

subrule Compute { 
Refresh ; 

for n = to 2 do { 
for TO 6 NbJnd do { 

Decode{-Retrievedm , -Decodedm) ; } 

Eval; 

for i = 0, 1,2,3 do { 

Encode {-EvaLoutput^, Jioldi) ; 

For 1 = 3, use indirect copying with Target-addr = .EvaLoutput^. } 

Hold-voten = Hold ; 

Target.addr^ := Target.addr ; } 
Hold := Maj^^o Hold_votc„ ; 
Target-addr := Maj^^^ Target-addr^ ; 
Randomize ; 
Refresh ; 

Update Joc-main t 
} 

16.4. Finishing the work period. The present subsection is devoted to achieving good upper and 
lower bounds on the length of the colony work period, i.e. on T*fc+i and T,j,_|_i in terms of T*k and 

T.J.. ThoTigh it seems likely that these bounds can be proven if each cell of the colony simply runs 
until the same maximum age [//j — 1, we did not find such a proof. Therefore we introduce some 
special rules. 

The March rule updates the age of a cell only when its neighbors allow it, and this is what makes 
the time estimates difficult. We will therefore define a different rule, Marchi, which allows updating 
a different field, Age^, independently of any neighbors. The idea is to end the work period when a 
certain cell reaches Agc^ = U, since then the (multiplicative) uncertainty in the length of the colony 
work period will not be much greater than the uncertainty in the length of the individual cell dwell 
period. We will use the Age^ of more than one cell, for robustness, but then we will need to reach a 
consensus among these about when the end comes. To leave clearly enough time for the consensus 
computation, and to make sure the work period ends in a coordinated way, what will be determined 
by consensus is some future value End of Age when the work period will end. 

16.4.1. Finding the work period end. We will also use the auxiliary field 



Endo e[0,U-l]. 



RELIABLE CELLULAR AUTOMATA 



129 



At the beginning of the work period, we start with Agei — 0, End — Endo — U — 1. (These values 
are extremely large for End, Endo.) Recall the definition of (end_period) in (13.10). The field Endo 
is required to have the property 

(16.5) Endo = C/ - 1 or Endo < Age + (end.period) 
which will be enforced by a trivial rule. The update rule of Age^ is trivial. 

rule Marchi { 
cond { 

? AgCi <U -I 

! Agei :=po AgCi +po 
? ! cond { 

? Endo = U -1 

! Endo :=po (U - 1) A {Age + (end_period)) 
}}} 

Thus, each cell updates its Age^ independently of all other cells, and the delays are also counted in. 
When the upper bound J7 — 1 of Age^ is reached each cell puts its estimate of the end of the work 
period into Endo. The subrule Find_end will last at most 

(16.6) (synch_time) — 16Q 

steps and will be run every 4(synch_time) steps after Age > (synch_start Jb) (see ( 13.8| )). It uses 
votes Endi e [0, J7 — 1] {i — 1, 2, 3, 4) to achieve consensus of the Endo values and to store it in End. 
The subrule 

CheckQ{synch_consensus, EndJegal, End4) 
checks whether there is a c such that End4(a;) = c for all but 3 of the cells x in the colony, 
subrule Find-end { 

for i e {1,2,3} do { 
cond { 

( 1 ) ? There is a c < ?7 — 1 such that End — c 
for all but 2 cells in the colony 

! Endi :=pi c 

(2) ? All but g/4 cells have Endo <[/- 1 

! Endi '-—pi the maximum of the Q/2 smallest values 
of Endo in the colony. ; }} ; 

cond { 

? There are two equal values among { Endi : « G {1, 2, 3} } 

! End4 :=pi Maj^i End, } ; 
Checki {synch_consensus, EndJegal, End4) 
cond { 

? EndJegal = 1 

! End := End4 

}} 

16.4.2. Finish. At Age = End for non-germ cells and Age — (germ_end) for germ cells, a final 
computation takes place in the rule Finish. The information from the Hold track will be copied into 
the corresponding locations on the Info track. This is done in accordance with (|]^): locations Jioldi 
for i = 0, 1, 2 go to _Infoi. On the other hand, location .HoJda goes to -Infon where n = Target-addr. 
Still, due to the positioning done earlier, the whole operation involves only copying within each cell 
and takes therefore only a single step. In particular, Outbuf* remains unchanged until the last step. 
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In the same step, addresses of growth cells will be reduced mod Q. Outer cells and inner germ 
cells turn into members, outer germ cells turn latent. End^EndQ, Age, Age^ become 0. 



16.4.3. Updating the locally maintained fields. The rule 

Broadcast{loc, F{I)) 

takes the information found in location Joe and writes it into the F field of every cell in interval /. 
The rule Checko{Check.vacant, F(J), loc) checks whether the string represented in Joe is Vac*. Let 
-Creating j be the known location of the field Creating* of the represented cell inside _Decodedo after 
decoding. The value in this represented field will be broadcast reliably to the track Growing j. 

rule UpdateJoc-maint { 
for n = to 2 do { 

Decode{_Hold, JJecodedo) ; 
Cheeko{Check.vacant, Voten, -Decodedo) } 
Doomed := Maj^^Q Vote„ 
for J G {-1,1} do { 
for n = to 2 do { 

Decode{-In{o, -Decodedo) ; 
Broadeast(_Creating , Vote„) ; } 
Growingj :— Maj„^o ^ote„ 
}} 



Remark 16.3. Defining the rules this way wastes many operations since we do not extract all infor- 
mation from its decoded form at once but rather decode repeatedly. But it is easier to follow the 
analysis in this form. <0> 



Lemma 16.4. Assume that we have a full colony in which all cells have ages before a starting point 
of Update JoC-maint . Then at the end of this rule we have the same conditions as at the end of 
Refresh, and in addition the following: 

(a) For all but at most 2 cells, the locally maintained fields Doomed and Growingj have constant 
values in the colony; 

(b) We have Doomed = 1 almost everywhere iff -Hold represents a vacant state; 

(c) For j S {—1,1}, for almost all cells in the colony. Growing^ is equal to Creating* of the cell 
state represented by the colony; 



Proof. The proof is similar to, only simpler than the proof of Lemma 16.1: simpler, since at the 
starting point of UpdateJoc-maint, we can already assume that the conditions asserted by that 
lemma hold. □ 



By choosing Q sufficiently large {Rq times the bandwidth), the time complexity of each of the 
Univ computations can be bounded by some constant times Q and therefore the total number of 
steps taken by Compute can be estimated as 

(16.7) KoQP - KoQ^^^^ < 2KoQ^ 

Wk+l Wk 



where Kq is some absolute constant, which is consistent with (13.6). 
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16.5. Legality. Let us prove parts ^ and (^) of Condition B.8 (Computation Property) for big 
cells. 

Lemma 16.5 (Legality). Assume that, for some rational number a, Damage* does not intersect 
{x} X [a-T,*/2+,a + 2T**]. 

(a) //ai,(T2 is defined in the interval [a+, a + 2T'*] Then 

(16.8) Jega4_^i(77*(a;,cr2-),?f (x,cr2)) = 1 

and T,* < (72 ~ fi < T'* . 

(b) // ao is defined then ( |l6.8|) holds for ctq in place of a-i . 



Proof. The discussion in Subsection 11.1 has shown that r]*{x,t) ^ Damage* iff the damage can be 



covered by a single damage rectangle in (x, t) + V' where V is defined in ( 11.1 ). The space projection 
of V is at least 1.5QB and the time projection is at least T,*/2. Thus, our assumption implies that 
during every time interval ofsizer,*/2 in [a — T,*/2,a + 2T'*], the interval [x — 0.25QB,x + 1.25QB] 
is intersected by at most one damage rectangle. 

Consider a time V2 such that ?/*(a;, V2) 7^ Vac. If uo is defined then let V2 = ctq, while if 172 is defined 
then let it be some time very close from below to cr2- 



According to 16.1.2| , there is a time vi in [v2 — (crit),D2] such that the colony C is covered with 



member cells belonging to the same work period at time vi and is not intersected by the extended 
damage rectangle at this time. Lemma |14.8| (Large Progress) says that, by setting ui — vi, there 
is a time uq G [ui — 5Qti,ui\, and a path Pi in C x [uo,ui] with di — {Pi{ui),Ui) within distance 
B of the center of C such that C x {uo} is not in the wake of a damage rectangle, is covered by a 
domain, and Q < Age{di) — Age((io). If do is a member cell let us apply the Large Progress Lemma 
(backward) repeatedly until either the path crosses a work period boundary, or the age along the 
path decreases by 

L = 6(synch_time) + (end_period). 
Suppose the cell we arrived at is not a member cell and has Age > End — Q. Then we perform one 



more repetition, but stop along the path as soon as the age decreases below End — Q. Lemma 14.6 
(Cover Ancestor) shows that we can stop at a time when the colony is covered by a domain. Let us 
now denote by do = (j/Oj '^o) the cell at which we arrive this way. 
1. Assume that do is an outer cell. 
Then we have 

(16.9) Age(di) - Age(do) <L + Q 



and at time wq, the whole colony is covered by outer cells. Lemma 14. 7| (Attribution) shows that 



do can be attributed to its own colony and that this colony could not have been occupied by 
member cells for at least (crit) time units before mq. Therefore r]*(x,uo) — Vac. 

In this case we have V2 = cq. The outer cells at time uq encode a latent cell of medium M*. 
Therefore eventually, when all outer cells turn into member cells, the new colony will encode a 



latent cell. Now, a vacant-to-latent transition is legal by Condition 11.3. 
The reasoning is analogous in case dp is a germ cell. Assume now that do is a member cell in the 
same work period as Ci: then Age(di) — Age(do) > L. Then we have the case when cri,(T2 are 
defined. 
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2. The Large Progress Lemma can extend the path repeatedly until we arrive at a cell cq = (xq, vq) 
in the same work period, with 

(16.10) Age(co) < 2Q, 

vi-vo< UT' + 5tiL. 

Proof. One can find a time ti in [uo,t;i] such that denoting ei = (Pi(ti),ti) we have 

Age(ei) < Age(do) + 6(synch_time), 
and there is a complete damage-free execution of Find_end during [Age{do) , Age{ei)]. 
2.1. This execution of Find.end finds more than Q/A cells in C with Endo = U — 1. 

Assume that this is not true. Then this execution of Find_end sets the common value of End to 



a value that was < EndQ{y, t) for one of the cells y at the time t when it examined y. By (16.5), 
then End < Age{y, t) + (end_period). It is easy to see that subsequent executions of Find-end do 
not change this value of End (except for the spot of damage, which will be corrected). Therefore 
the work period will have to end at an age before Age(ei) + (end_period) which is not possible 
given our definition of ei, since we came back L steps within the work period to Age{do) and 
went forward at most 6(synch_time) steps. 

Let us continue applying the Large Progress Lemma from di and extending the path backward 
from do until a cell cq — {xo,vo) with Q < Age(a;o) < 2(5 or vq ~ ai, whichever comes first. 
Look at the cells y which were found by the above-mentioned run of Find_end at some time t to 
have Endo{y, t) = U — I and hence Agei{y, t) < U — I. There are so many of them that some of 
them will be damage-free during the whole work period. Take one such cell y. Rule Marchi keeps 
increasing Agei{y) between times vq and t, hence t — vq < UT*, and 

vi — t < vi — uq < UT* + 5tiL. 

Let us show that Age(co) > 2Q is not possible. Indeed, this would mean vq = <Ji- Then the big 
cell y would go through a switch of a state in ai but our backward path has never changed a 
colony work period. 

, Under the assumptions, the switch in CT2 is legal. 

Proof . Let us follow the development of the colony C(a;), starting from the time vq found 
above. Since the big cell is non- vacant during this, location _Info encodes a string with at most 2 
errors. The rule Compute, begins, at age (compute_start), with an application of Refresh. As the 
corresponding lemma shows, at the end of this, location -Info will still encode the same string with 
possibly fewer errors. Thereafter, the Info track will not change until the next work period, except 
for the local changes caused by the damage and its corrections by new applications of Refresh. 



The following computation can be analyzed similarly to the first part of the proof of Lemma 16.1 
(Refresh). According to the definition of rule Eval, the value put into the part corresponding to 
Memory* of -Hold is obtained without using the result of Retrieve, so it is certainly a value legally 
obtainable from -Info. 

Any subsequent change of -Info or -Hold caused by a damage rectangle will be corrected by a 
later Refresh, and any new inhomogeneity of Doomed will be corrected by Heal. If Doomed = 
1 then the whole colony dies, since the exposed right end is impossible to heal. In the case 
Doomed = 0, the rule Finish installs the changes on the track Jjifo in one step, leaving again no 
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place for more than 2 errors. In both cases, the state at time (T2 will be a legal consequence of the 
state at time a^ — . 

4. Let us lower-bound now the distance between the two switching times of r\* in x. (The proof of 
the upper bound is similar but simpler.) 

The usual analysis of Find-end shows that this rule changes End either in all cells (but possibly 
two) or in none of them (but two). Suppose first that End — U — 1 stays that way during the 
whole work period. Then the end would come only at Age = U — 1. Measuring the time for a cell 
that has never been affected by damage, this would give a lower bound {U — 2Q)piT, > T,*. 

Suppose now that End becomes smaller than U — 1. Consider the first execution of Find_end 
when this happens. Then there must have been some damage-free cell x such that by the time ti 
this execution inspects x it has Agei{x,ti) = U — 1. Let to be the time when Agei{x,to) = at 
the beginning of the work period: then ti —Iq > UT,. From here, the desired lower bound follows 
easily. 

□ 

The lemma below follows easily from the above proof and from Condition |8.5| (Time Marking) . 

Lemma 16.6. Assume that, for some rational number a, Damage* does not interseet {a;} x [a — 
T,*/2+, a -f- 2T**] . If rj* has no switching time during [a+, a + 2T'*] then r/*(a;,a+) — Vac. 

The following lemma infers about the present, not only about the past as the Attribution Lemma. 
For an extended damage rectangle [ao,ai] x [wo,Mi], we call the rectangle 

[ao,ai] X [mo,wo + 4ti] 

its healing wake. 

Lemma 16.7 (Present Attribution). Assume that the live cell cq = [xojto) in colony C with base 
cell zq is not a germ, and is to the left of the center of its originating colony. Assume also that C 
at time to does not intersect the healing wake of any damage rectangle. Then one of the following 
cases holds. 

(1) Cq is a member cell, attributed to C which is covered by a domain of member cells at time to. // 
Q < Age < End — Q then the Info track of this colony has at most 2 errors; 

(2) Co is a member cell from which a path of time projection at most Qti leads back to to a growth 
cell in C and this growth cell can be attributed to C + QB. At time to, ifC + QB does not intersect 
the healing wake of a damage rectangle then [xo,Zo + QB—] is covered by a domain; 

(3) Co is an outer cell, attributed to its originating colony. At time to, if C + QB does not intersect 
the healing wake of a damage rectangle then [xo,Zo + QB—] is covered by a domain; 

(4) Co is a member cell and there is in C a backward path from co, with time projection < 2(split_t) + 
{Q + 1)ti going back to a domain of doomed member cells covering C; 

Proof. 

1. Suppose that co is a member cell and C was covered by member cells at some time in the set 
I \ E {as defined around ( 14. l| )). 



Using the Large Progress lemma repeatedly, we can go back to a time before age (compute_start) 
in C, just as in the proof of the Legality Lemma above. Then we can follow the development of the 
colony forward and see that it forms a continuous domain together with its extension. The Jnfo 



track has at most 2 errors after the first application of Refresh, as seen by Lemma 16.1 (Refresh) 



If the computation results in a nonvacant value for the represented big cell then we have case (|l|) 
of the present lemma. The represented field Creating^ of the colony will be broadcast into the 
field Growing j of its cells, as shown in Lemma 16.4. The homogeneity of this latter field will be 
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maintained by Heal. Thus, depending on the value of Creatingj of the big ceU, growth wiU take 
place and the growth forms a continuous domain with the originating colony. 

Suppose that the computation results in a vacant value. Then Growingj will be everywhere 
but in the healable wake of the damage. Growth cannot start accidentally by a temporary wrong 
value Growing j — 1 in an endcell since there is enough time to correct this value during the long 
waiting time of Grow^step. Also, all cells become doomed. After Age = 1, the doomed right end 
becomes exposed by definition, and the whole colony decays within Qn time units. Before that, 
the colony is full. After that, we have case (|^) of the present lemma. 
, If Co is an outer cell then we have case of the present lemma. 

Proof. The Attribution Lemma attributes cq to the originating colony which is covered by 
member cells during I \ E. It forms a continuous domain with its extension, until the age End. 
From that age on, they form a multidomain. This could only change if the originating colony 
goes through a next work period and kills itself; however, there is not enough time for this due 



to (|13 



, Suppose that cq is a member cell but during I \ E, the colony is never covered by member cells. 
Then we have case (||) of the present lemma. 

Proof. Let ci be a non- member cell in C during I \ E. Then it is an outer cell that can be 
attributed to its originating colony and then the reasoning of ^ above can be repeated. 

□ 
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17. Communication 

17.1. Retrieval rules. Here, the rules for information retrieval from neighbor colonies will be given. 
Neighbors of a cell are of the form dj (x) for j G NbJnd where 

NbJnd = {-1.5,-1,0,1,1.5} 



was defined in Subsection 8.2. Now these same notions of neighbor are applied to big cells and the 



distance unit is QB rather than B. We define Mailjnd and k as in 7.4.2: 

MaiUnd = {-1.1, -0.1, 0.1, 1.1}, 
k — sign(fc) [|fc|J . 

First some definitions will be given, for a cell x and its neighbor y in direction j which is equal to 
"^i.bjix) if the latter is nonvacant and otherwise. For a cell x, let 

(17.1) X* =x- Addr{x)B 

be the base of the originating colony of x and 

X - (Addr(a;) mod Q)B 

be the base of the colony of x. Thus, we have x* — x*'^ iff Addr{x) G [0, Q — 1]. The relation of cells 
and their originating colonies will define the predicate 



EdgeJx) 



'0 ifa;*=y*, 

1 if a; and y are members of two adjacent colonies, 

1.5 ifl<{x* ~y*)/{jQB)< 2, 

oo otherwise. 



The field Mailk has the same subfields as in Subsection 7.4, The subfield Mailk-Status can now also 
have the value 

Fickle 

standing for "transitional". When Mailk-Status = Fickle then the other subfields of Mailk will play 
no role therefore we will, informally, also write Mailk = Fickle. The mail field Mailk of a cell will 
cooperate, in direction j, with mail field 

peer{k, j) ^ k - j [Edge^] 

of a neighbor if the latter index is in MailJnd. To control the track Mailk, there is a one-bit track 

Controlk G {post, get} 

which moves in the direction — sign{fc) (opposite to the direction of mail movement which it controls). 
For |fc| = 0.1, the values of Controlk will simply depend on age: in certain stages of the program the 
mail must be posted, in others it must be passed. For |fc| = 1.1, the value of Controlk will travel in 
the direction — sign(fc). For robustness, a change in the value of Controlk will propagate into the 
inside of the next colony only when it is not to be followed by a change back too soon. By definition, 
for j G { — 1, 1}, either or t?i.5j(x) is vacant. For every field F we denote by 

(17.2) F^^') 

the field F of the nonvacant cell -dnj (x) . Here is the formal propagation of control: 
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rule Propag_control { 

pf or j in { — 1,1} do { 
cond { 

? Edgej > or 

(1) (if 'd2j{'x.) is in the colony of x then Control^\j ~ Control{ 

! Controh i,- := ControP^ n t ■ ■\ 

}}} 

Condition ([^) wiU make sure that a damage rectangle creates at most 1 wrong value of the control 
and that this wrong value practically does not propagate. 

Remark 17.1. In the present discussion, we assume that \Mailk.Info\ = \In{o\, which is not really 
true for a separable transition rule. However, as mentioned in Remark 7.£ , the only modification for 
the case of a narrow mail track is that the Info track must be split up into narrower tracks (packets) 
and each narrower track must be mailed in a separate copy operation. <D 

The rule below shows when and how mail will be posted: 
rule Post_mail { 

for k e {-1.1,1.1} do { 
cond { 

? Controlk — post and Kind £ {Member, Growth} 
! cond { 

? Age ^ [Q, (grow_start)] 
! Mailk Fickle 
?! { 

Maih-Info := Info 
Mailk-Addr := Addr 
}}}}} 

Even if the posting of mail is requested, the posted mail will have value Fickle if the age of the 
sending cell is close to a work period transition of its colony (this will be the case with all growth 
cells). 

Let us define, for j = — sign(fc) and each subfield F of Mailk, 

{Undef if peer{k,j) is undefined, 

j ■ EdgBj if F = Fromnb, \k\ = 0.1, 1 < Edge^ < oo, 

MaiP^ .s.F otherwise. 
pccr{k.j) 

The "hand-shaking" condition 

MaiLused{k) 

says that the cell is free to rewrite Mailk since the value was already used by the neighbor in direction 
j = sign(fc). It is defined to be true if either peer{k,j) or peer{k, — j) is undefined, or the following 
holds for all F: 

{—j ■ EdgCj if F = Fromnb, k — l.lj, 1 < Edge^ < oo, 
Mailk -F otherwise. 

Information on the Mailk tracks moves under the effect of the following rule, except when it is 
overridden by Post^mail. 
rule Move^mail { 

pfor k in MailJnd do { 



Maif^^ ...F : 

pecr{k.j) 
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? Controlk — get and MaiLused{k) 

! Mailk ■= MaiLto^receivek 

}} 

Arriving mail from neighbor m will be stored in location ^Retrievedm on track Portm- Let us call 
by _Retrievedm-Status the status track of the location _Retrievedm- Locations -Retrievedm will be 
checked as to whether the information retrieved in them is transitional (a similar situation arises in 
the proof of Theorem g.lC| (Reach Extension)). The program 

Checko{check_retrieve, Ck_ces, loci, I0C2, ■ ■ ■) 

checks whether for each of the locations that are its arguments, there is a c G {Normal, Undef } such 
that with the possible exception of 3 cells, all symbols are equal to c. The exceptions allow for one 
damage rectangle in each interval of size QB along the way from which information is retrieved. The 



checking rule Checki{) was defined in 16.2, and the program recheckA was defined at (16.4). 
subrule Age_ciiec-k { 

Ci2ecici(ci3ec]c_retrieve, Ci:_res, {-Retrievedm- Status : m G NbJndj); 

Checki{recheckA , Ck_res, Ck_res, 2) ; 

} 

Lemma 17.2 (Age Check). 

(a) Assume that at the beginning of an application of Age.check, for all m, with the possible exception 
of 4 symbols, either all elements of the string in the location _Retrievem-Status are Normal or 
all are Undef. Assume moreover that either the check is damage- free or we can write 3 in place 
of 4 above. Then at the end, with the possible exception of 2 adjacent cells, all cells of the colony 
have Ck_res = 1. 

(b) Assume that at the beginning of Age-check, for some m, it is not true that with the possible 
exception of 4 symbols, either all elements of the string in the location _Retrievem.Status are 
Normal or all are Undef . Then at the end, with the possible exception of 2 adjacent cells, all 
cells of the colony have Ck_res = 0. 



Proof. The proof is similar to the proof of Lemma |16.l| (Refresh) . □ 



The subrule Retrieve is essentially a repetition of another subrule. 
subrule Retrieve { 
Ck_Tes := 
repeat 5 times { 
Ret recycle ; 

idle for X{End — (grow_start) + Q) steps 
}} 

The long idle time between the application allows for getting the neighbor colonies out of a possibly 
transitional situation. The rule Retr^cycle will send the commands post and get on its control 
lines. Command post is repeated just enough times to get it across. Command get is repeated 
enough times for the mail track to transfer the information from any neighbor. During this time, 
the rule will snatch information from each track Mailk to the appropriate track Portm. Then, the 
information will be checked for safety using Age_chcck. If it is safe then the retrieval will not be 
repeated: the rule still goes through the same number of steps but without resetting the receiving 
area and without any snatching, 
rule Retr_cycle { 
cond { 

? Ck_rcs = 
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! f or m e NbJnd \ {0} do { 

Write ( Vacant_str , _Retrievedm )', }}> 
repeat 3QX times { 

(1) for j = -0.1,0.1 do { 

Control j := post }}; 

(2) repeat 2QX times { 

for j = -0.1,0.1 do { 
Control j := get }} 

(3) II cond { 

? Ck_res = 
! for j = -0.1,0.1 do { 
cond { 

? Addr — Mailj.Fromaddr 

! PortMailj.Fromnb ■= Mailj }}}; 

Age-check 
} 

The total number of steps of Retrieve can be estimated as 

ciOA||§fe||/«;fc 

for an appropriate constant ci. 

17.2. Applying the computation rules. For proving part (|) of Condition (Computation 
Property) for big cells, assume that for some rational a, we have 

(17.3) Damage* n[x- 3QB, x + AQB-] x [a - T,* /2+, a + 2T**] = 0. 

By Lemma 16. 6| , if there is no switching time during [a+, a + 2T**\ then 77* (.t, a+) = Vac. 



Lemma 17.3 (Retrieval). Assume that a orai,a2 is defined for rj* . Then we have 



or (g) of Condition S.S (Computation Property). 



Proof. As in the proof of Lemma 16. 5| (Legality), consider a time V2 such that r]*{x,V2) ^ Vac. If 



CTo is defined then V2 = ctq, while if cr2 is defined then it is some time very close from below to CT2. 
Let us define vi as in that proof. As in part ^ of that proof, the Large Progress Lemma can extend 
the path repeatedly backward from vi until we either arrive at either an outer or germ cell or at a 
cell Co = {xq, vq) in the same work period, with Age(co) < 2Q. 

1. Assume that cq is a germ cell. Then we have case (g) of the Computation Property. 

Proof. Repeated application of the Large Progress Lemma gives a time interval long enough 
during which the colony and the adjacent colonies on both sides have been covered by germ cells. 
This implies that the state of the new big cell is latent and also that for the big cell x, we have 
r]*(dj{x, t, rj*), t) — Vac for some time t in [a+, (Tq^] and for j ^ 0. 

2. Now assume that co is a member cell. 

From time on, we can follow the development of colony C. The computation process can be 
treated similarly to the proof in the Legality Lemma; the additional problem is the retrieval of 
the needed information from the neighbor colonies in such a way that all retrieved information 
can be attributed to a single time instant. (This issue was solved once already in the proof of 



Theorem B.IO (Reach Extension)). 



Event (17.3) implies that in the whole space-time area in which this communication takes place 



only the usual damage rectangles must be considered. Due to (13.8), the whole computation fits 
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into an interval of length T,* and therefore in each direction, at most 3 damage rectangles can 
affect the retrieval (considering communication with a non- adjacent neighbor colony). 

The extension arms of the colony will be extended by the rule Extend, defined in Subsec- 
tion 12.4. The success of carrying out the extension over possible latent or germ cells or an 



opposing extension arm in the way is guaranteed by Lemma 12.8 (Creation). If the extension cells 
of C are growth cells then they are stronger but the age check will fail in this case, so we can as- 
sume that this is not the case. If there are extension arms from both colony C and a non-adjacent 
neighbor colony C then the strength ordering gives preference to one side and therefore soon only 
one of the extension arms remains. 

Part (|l|), of rule Retr_cycle sends the message post along the control line Controlo i to the cells 
of the nearest colony C (or its extension) on the left communicating with C. The existence and 
condition of C will be discussed below. This message will be maintained by the healing rule as a 
locally maintained field. Its value arrives on control line Controli i to C (or its extension). There, 
its value is propagated (carefully) by Propag-Control. Via the rule Post_niail, the signal causes 
the cells of C to post the needed information. 

In due time, part (||), of rule Retr.cycle sends the message get along the same control line to 
colonies C and C; this causes the track Mai7i.i of C — QB to pass its information to the right, 
to track Mailo.i of C, which passes it further to the right. Damage can occur during the passing 
phase, but after healing its effect is confined to information (not the control fields) in the cells 
(at most two) which it changed. During the same phase, due to the parallelly running part (^) 
of Retrieve, the Portm fields of each cell of C snatch the information addressed to them from the 
right-moving track MaiJo.i- 



At the end of Retr_cycle, rule Age.check will be applied. By Lemma 17. 2| (Age Check), at the 



end of the application of this rule, all but maybe 2 cells of the track Check_res contain the same 
value. We will say that the process passed or failed the age check. The rule Retr^cycle is iterated 
5 times but it will not affect the Portm tracks after it passes the age check. 

2.1. The age check can fail at most 4 times. 

Proof . For every neighbor colony from which mail is to be retrieved, at most two iterations of 
Retr_cycle can fail the age check. Normally, there can only be one such failure, when the colony 
was found in a transition age: this gives at most 1 failure on the left and 1 failure on the right. 
Due to the idling between iterations of Rctr_cycle, the next iteration will find the colony past 
the transition age. 

It can also happen that the neighbor colony was found in a transition age just before vanishing 
and next time it will be found in a transition age when it is just being recreated by a farther 
colony. However, it is easy to see that this can happen only for one neighbor colony on the left 
and one on the right, bringing the total number of failures to at most 4. 

Take an iteration of Retr.cycle that passes the age check. Let to be the earliest time after the 
beginning of this iteration when the whole area D = [x — 2QB,x + 3QB] is free of the healable 
wake of any damage rectangle, and let ti be the end of this iteration. Let us call any colony C 
with base x' in [a; — 2Q+, x — Q] a, potential left neighbor colony. 

2.2. Suppose that there is a potential left neighbor colony C that stays covered during [to,ti] 
(a temporary hole due to the damage rectangle and healing within 4ti time units does not 
count). 



140 



PETER GACS 



It is easy to see by Lemma 16.7 (Present Attribution) that all cells between C and C (if any) 



not in the wake of a damage rectangle belong to extensions of either C or C. 

The retrieval from C proceeds according to the program. The mail will be passed from C to C 
through the remaining extension arm (if any). The age check guarantees that all the retrieved 
information can be attributed to a single instant of time before ii, since it will not change 
during the at most iQri time units needed for its reading (we count ri for each mail-passing 
step, and add Qti more time units for the possible push-back of extension arm by an opposite 



According to Lemma 16.7 (Present Attribution), before retrieval, each of the strings to be 
retrieved represents some string within 2 deviations. During retrieval, damage can change at 
most 4 cells, increasing the number of deviations to at most to 6. Since we use a 6-error- 
correcting code the computation proceeds from here as expected. 

2.3. Suppose now that there is no potential left neighbor colony that stays covered during [io, ^i]. 



There are the following possibilities, according to the Present Attribution Lemma 16.7 : 

(1) No cells occur in [x — QB, x—] during [io, ^i]; 

(2) Growth to the right from some colony C — QB where C is a potential left neighbor colony; 



(3) A potential left neighbor colony C is beginning to disappear as in part (y) of the Lemma 16.7 
(Present Attribution). Then within 2Qti time units hereafter all cells of C disappear. 

Let us look at the Porti track before the age check, for i S { — 1,-1.5}. If for some cell x on 
it Porti{x) is not the direct result of damage and Porti.Status{x) ^ Undef then this value 
was obtained from a potential neighbor colony C by mail during the last retrieval cycle. In all 



the cases considered, this value is then Fickle. Since the age check passes. Lemma 17.2 (Age 
Check) implies that, with the possible exception of 2 intervals of 2 cells, the cells of C have 
Porti. Status — Undef. Thus at the end of Retrieve, the string on the Porti track represents a 
vacant value within 4 errors. 

2.3.1. There is a time interval of length at least 3Qti before ti in which all big cells corresponding 
to potential left neighbor colonies C are vacant and in which therefore the information 
collected by Retrieve can be attributed to any instant. 

Proof . In case (Q), the big cell corresponding to colony C is indeed empty all the time. In 
case (H), the growth process from colony C — QB could not have started long before to since 
so few cells reported unsafe age: therefore the duration of growth provides the necessary long 
interval. In case (||), cells of C start vanishing within Q steps after tg. Otherwise namely the 
Porti track would snatch values with Status = Fickle and then would not pass the age check. 
Once the vanishing of colony C started, it will be over within Qti time units. Rcpopulating 
it by growth will take sufficiently long that it does not create a new cell during this retrieval 
cycle. 

, Assume that cq is an outer cell. Then we have case (|) of the Computation Property. 
Proof. The fact that the new big cell is latent can be similarly concluded. The Attribution 
Lemma allows us to attribute the outer cell in question to a neighbor colony, say with base cell 
X — QB on the left, from which it has grown. This implies Growingi — 1 for this colony, which, 
as seen above, implies Creatingi = 1 in the big cell x — QB. 
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Now the reasoning of |^ above can be applied to this big ceU, seeing that it computes its next 
state according to the transition rule from information retrieved from its neighbors. In particular, 
it applies the rule Arbitrate and therefore can only have Creatingi = 1 if the big cell x is vacant 
when observed. 

Therefore, on the other hand, the creation of the latent big cell x can be attributed to the rule 
Create, hence the transition rule was also satisfied in big cell x. 

□ 

The following lemma covers the cases left after the above lemma. 



Lemma 17.4 (Growth). A ssum e that we have ( 17. 3| ) and ri*{x,t) = Vac for t e [a+,a + 2T 



Then part (|h|) of Condition p.<q (Computation Property) applies to rj* 

Proof. Let C be the colony whose base is x. If there is any {y,t) with < |y — a;| < QB, t G 
[a+,a + 2T'*] and r]*{y,t) ^ Vac then we are done; assume therefore that there is no such y. If 
there is any t in [a+,a + 2T'*] when both r)*{x — QB,t) and r]*{x + QB,t) are vacant then at this 
time there is no potential creator, and we are done. The case remains when for all t in [a+, a + 2T**] 
there is a j S {—1,1}, with r]*{x + jQB,t). Creating _j = 1 and there is no big cell y during this 
time whose body intersects the potential body of the l3ig cell x. We will show that this case does 
not occur. Without loss of generality, suppose 

r]*{x - QB, a + 2T'* - T,* / 2). Creating^ = 1. 

1. Suppose that there is no t in [a+, a + 2T** - T,*/2] with r]*{x + QB, t).Creating_^ = 1. 

Then for all t in this interval, we have rj*{x — QB, t). Creating^ = 1. Tracing backward and forward 
the evolution of the colony of the big cell x — QB (and applying Lemmas 16. 5| (Legality) and 17.3 



(Retrieval), we will find a whole work period [ui, U2] of colony C — QB in [a — T,*/2, a + 2T'* — 
r,*/2]. At the end of this work period, growth to the right takes place. 

If the growth succeeds this would contradict our assumption: suppose that does not. This 



can only happen, according to Lemma 12.8 (Creation), if some non-germ cell z is in the way. 



Lemma 16.7 (Present Attribution), shows that z is a left extension cell of a live big cell y in 



[x + QB+, X + 2QB—], and is therefore not stronger than the right growth it is keeping up. 

The rule Arbitrate kills z since it prefers right growth to the left one, and therefore z does not 
really prevent the right growth from succeeding. Since this preference is arbitrary assume that 
the rule Arbitrate actually prefers growth in the left direction and that z is a left growth cell. We 
can trace backward and forward the evolution of the colony of y to see that it carries its growth 
to conclusion, resulting in a new big cell y — QB before time a + 2T'* , whose body intersects the 
body of X. This has also been excluded. 
2. Suppose that there is also a t in [a+, a + 2T'* — T',*/2] with rj*{x + QB, t).Creating_i — 1. 
Tracing backward and forward the evolution of the colony of the big cells x — QB and x + QB 
(and applying the Legality and Retrieval lemmas) i t is e asy to see that they are non-vacant for 



all t in [a+, a + 2T'*] (since according to Condition 12.6 , Creating ^ = 1 implies Dying — 0) 



It is easy to see that there isaj G { — 1,1} and [ii, t2] C [a — T,*/2,a + 2T'*] such that [ti,t2] is 
a dwell period of big cell x + jQB with Creating _j = 1. Assume first that j = — 1 can be chosen. 



In the colony C of a; between the colonies C — QB and C + QB, due to Lemma 16.7 (Present 
Attribution), all non-germ cells belong to the extension of one of these two colonies. The growth 
rule of a; — QB will try to create a latent big cell in C. If the growth rule of C + QB is not active 
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then there are no obstacles to this creation and it succeeds, contrary to the original assumption: 
so we are done. Moreover, it succeeds even if the growth rule of C + QB is active since the strength 
relations favor growth from the left. 

Assume now that only j — 1 can be chosen. The reasoning is similar to the above except when 
the growth rule of C — QB is active at the same time when C + QB is trying to grow left. This can 
only happen when the dwell period [^i, of big cell x + QB overlaps with a dwell period [ui, 1*2] 
of big cell X — QB with Creating^ — 1. By our assumption, [ui, U2] <^ [a — T,*/2, a + 2T'*]. 

In the next reasoning, we will use the fact that growth is confined to a time interval of length 
at most r,*/2 at the end of any colony work period. 

Assume t2 > a + 2T'* — T,* /2. Then we must have it2 > a + 2T** . Hence the previous 
dwell period [uo,ui] oi x — QB is in [a+,a + 2T'*] and must have Creatingi = 0. Therefore 
Creating_i = 1 in the previous dwell period [to, ti] of cc + QB. Due to the confinement of growth 
to the end of the work period, the growth to the left from C + QB at the end of the period [to, ti] 
will be undisturbed by growth to the right from C — QB and the creation succeeds near time ti, 
contrary to the original assumption. 

Assume now t2 < a + 2T'* — r,*/2. Then due to the confinement of growth just mentioned, 
we have U2 <a + 2T** . Hence, ui < a — r,*/2, and the next dwell period of x — QB, contained 
entirely in [a+, a + 2T**], must have Creatingi = 0. Then the next dwell period of x + QB is still 
entirely in [a+, a + 2T**] and has Creating_^ — 1, and the creation to the left succeeds. 

□ 



17.3. The error parameters. Lemma 9.4 (Amplifier) says that given a (broadcast) amplifier frame 
Frame with large enough Rq, we can complete it to a uniform (broadcast) amplifier. We leave out 
the case of broadcast amplifiers: the proof comes with some simplification from the proof for general 
amplifiers. 

The main ingredient of our amplifier is the sequence of transition functions Sim^tranSk and the 
sequence of codes tpk*, fk**, fl: wh ich w as defined in the course of the proof. It remains to verify 



the amplifier properties listed after (3.21). Let us recall them here. 



(a) (Rider'') with guard fields {Guard'') is a guarded shared field for (ipk*), {^k**), as defined in 



Subsection 4.2; 

(b) the damage map of the simulation <l>fe is defined as in Subsection ^T| ; 

(c) Trk is combined from Rd-transk and Sim_transk , with fields Rider , Guard ; 

(d) Mk^Rob{Trk,Bk,T,k,T'k,ek,s'^); 

(e) $fe has e'^'-error-correction for tpk**'. 

The first three properties follow immediately from the definition of Trk and the code, given in the 
preceding sections. It has also been shown, by induction, that Mk+i is a robust medium simulated 
by Mk via the code, with Ek as the error bound and T.^,, T*k as the work period bounds. It needs 
to be shown yet that indeed serves as the bound in the definition of Mk = Roh{- ■ ■), and that 
the simulation has e'^'-error-correction. 

s'k+i must bound the difference from 0.5 of the probability of the new coin-toss of the simulated 
computation, and the simulation in the work period must be shown to have ej^'-error-correction. 
We must show that in a big cell transition, the probability that the Rand* field is 1 is in [0.5 — 
e5j._i_]^, 0.5 + Efc+i]. (We also have to show that the bounds on the probabilities are of the form of 
sums and products as required in the definition of canonical simulation, but this is automatic.) 
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In case there is no damage rectangle during the whole work period, the field Rand* was computed 
with the help of the rule Randomize. This rule took the value X found in field Rand of the base 
cell of the colony and copied it into the location holding Rand* . 

Remark 17.5. This is the point where we use that our simulation is a canonical simulation but not 
necessarily a deterministic one: the time at which the the base cell tosses the coin can be found 
using a stopping time within the colony work period. <0 

By the property of Mk, the probabihty that X = 1/2 is within of 0.5. By its definition, e'^ 
upper-bounds the probability that any damage rectangle intersects the colony work period. Therefore 
the probability that Rand* ^ X can be bounded by Sk- Hence, the probability that the Rand* field 
is 1 is in 

[0.5 -4 -4,0.5 + 4 + 4'] = [0.5-4+1,0.5 + 4+1]. 

As just noted, with probability ej^'+i, no damage rectangle occurs during a colony work period. 
Under such condition, the rule Refresh works without a hitch and each cell contains the information 
encoded by the code from the state of the big cell. This shows that our simulation has the 
e^'-error-correction property. 



144 



PETER GACS 



18. Germs 

18.1. Control. Recall the definition of r(/,d) in ( |10.2D . 

Lemma 18.1. Let xq be a site and ti < t2 times. Assume that (xq, ti) is controlled and r(a;o, SB) x 
[ti — 8r*,t2] is damage-free. Then (0:0,^2) is controlled. 

Proof. In the proof below, controlling always means controlling (ccq, ti). Let to ~ ti — 6T'. For t in 

[^0,^2], let 

A{t) ^T{xo,5B) X [to^t]. 



. Let t £ [ii,t2]- Assume that xq is controlled for all t' < t. Then [xa — 3-6, xo + 3B—] x {t} 
contains a cell. 

Proof. By our assumption, for all t' < t, each set [xq — B,xo+ B—] x [t' — 6T*, t'] contains a cell 
Xi. If it stays until time t then we are done. If it disappears the death must have been caused by 
Arbitrate, due to a cell yi with \yi — xi \ < 2B about to create another cell whose body intersects 
with the body of xi. Cell yi had Dying = to be able to kill and therefore survives until time t. 
. Let t G [ti,t2]- Assume that Xq is controlled for all t' < t. Then T{xi,B) x [t ~ 6T',t] contains 
a cell. 

Proof . Bu assumption, for all t' < t, each set r(a;o, B) x [t' — QT*, t'] contains a cell xi. Without 
loss of generality, suppose xi < xq. If this is also true for t' ^ t then we are done. Suppose it 
does not: then one such cell xi disappears at time t — 6T*. This must have been caused by the 
rule Arbitrate, due to some cell yi about to create an adjacent neighbor whose body intersects 
with the body of xi. Without loss of generality, assume yi S [xi — 2B+, xi — B—]. According to 
the Arbitrate, cell yi must have had Dying = 0, Creating^ = 1 to be able to erase and therefore 



survives until time t. Since according to Condition 12.6 , only the rule Arbitrate changes Creating^ 



and only when a right neighbor has appeared, cell j/i keeps trying to create a right neighbor. 

2.1. Suppose that yi succeeds in creating yi + B before time to + 2T'. 

liyi+B > Xq — B then it will control; otherwise, yi + B will try to create yi + 2B. If it succeeds 
in 2T* time units then the created cell will control; if it does not then then according to the 
Computation Property, another cell in [yi + 2B+,yi + 3B—] interferes and it will control. 

2.2. Suppose that the creation does not succeed within 2T' time units. 

Then a cell X2 G [yi + B+, yi + 2B—] interferes. If 0:2 > xq — B then X2 will control, suppose 
therefore that X2 < xo — B. We have — X2I < B since xi,X2 G [yi + B,yi + 2B]. Therefore 
X2 must have arisen after xi disappeared. Due to part |l| above, X2 could not have arisen by 
spontaneous birth, hence it must have been created by a right neighbor. If X2 was created after 
to then its creating right neighbor is alive after and it will control. Suppose therefore that 
X2 was created at time to and its right neighbor disappeared by time to. 

Suppose that X2 stays for at least 2T' time units; then it tries to create X2 + B. If it succeeds 
then X2 + B will control; if it does not then the cell that interferes in the creation will control. 

Suppose that X2 disappears before to + 2T' . Suppose that yi now succeeds in creating yi + B 
before time to + AT' . li yi + B > xo — B then it will control; otherwise, it will try to create 
2/1 + 2B. If it succeeds in 2T* time units then the created cell will control. If it does not then 
the cell that interferes will control. 
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Suppose that yi still does not succeed in its creation. Then another cell X3 appears before 
to + 4r* that prevents this. This 2:3 must have been created by a right neighbor that will 
control. 

□ 

Lemma 18.2. The media of our amplifier have the lasting control property. 
Proof. Assume that for some sites xq and times ti < t2, with 

I = Tixo,{it2-t,)T'/T,+A)B), 
I X {ti} is blue and I x [ti — 2T*,t2] is damage-free. We have to prove that {xQ,t2) is blue. 



Lemma 18.1 implies that this point is controlled; what remains to show is that the controlling 
cell in r(xo, B) x [^2 — 6T*, t2] is blue. Due to part Q above, in the area considered, there is always 
some cell within distance 3B left or right of any site, and this prevents any germ cell from being 
born. If therefore the controlling cell would be non-blue then one could construct from it a path of 
non-blue cells backwards. This path cannot reach outside / during [ii, t2] since it needs at least T, 
time units to move one cell width away from xq. □ 

Let us introduce an undirected graph G{ri) on space-time points of a space-time configuration 77 
by defining below the following kinds of edge. 

(1) points {x,t) and {x,t') are connected for t < t' if a cell of 77 is present in x for all of [t,t']; 

(2) suppose that |a; — ?/| < 2B and there is a cell at time t in both x and y, and moreover there is a 
cell in y during the a whole dwell period of x containing t. Then (x, t) and {y, t) are connected. 

(3) if a cell has just been created in {x,t) by an adjacent cell y whose last dwell period before t 
begins in t' then {x,t) and (y,t') are connected; 

These are all the edges of graph G. 

Lemma 18.3. Let to = <i - 8T', 

/ = r(xo,9S). 

Assume that I x {to} is controlled and the area / x [to — ST*, ti] is damage-free. Then there are sites 
zi < X{) < Z2 and a path in the graph G defined on I x [to,ti], connecting (zi,ti) with (z2,ti). 

Proof. Since (xo, to) is controlled there is a cell (xi, ui) in r(a;o, B) x [to — 6T*, to]. Without loss of 

generality, assume that xi >xq. 

1 . Suppose that xi stays a cell until time ti . 

Since {xi — B,to) is controlled there is a cell (0:2, ^2) inT{xi — B, B) x [to — 6T', to]. If the cell in 
X2 exists until time ti then necessarily X2 < xq, and it is easy to see that there is a time Vi such 
that the points (a:i,ti), (x2,Wi), (2:2, ti) form a path in G. 

1.1. Suppose that the cell in X2 lasts until to + 2T*. 

Then it coexists with xi at time to, therefore X2 G [2^1 — 2B+,xi — B]. Since {x2 — B,to) is 
controlled there is a cell {x^jUs) in r(a;2 — B,B) x [to — 6r*,to]. If the cell in 0:3 exists until 
time ti then 2:3 < a;2 — B, and it is easy to see that there are times Vi such that the points 
(a;i,ti), {xi,vi), {xi,V2), (2:2, W2) (2^2,^3), {x3,V3), (2:3, ti) form a path in G. 

If the cell in 2:3 disappears before time ti then this has been caused by Arbitrate, via a cell 
yi about to create another cell intersecting with the body of X3. As seen in similar discussions 
above, cell yi exists during the whole of [to — 8r*,ti]. Therefore it is not equal to X2 which 
is assumed to disappear. But its body does not intersect the body of of either 2:2 or 2:3 hence 
2/1 G [X3-2B+,X3- B-]. 
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If cell X3 lasts until to + 2T' then it is easy to see that there are times Vi such that the points 
(xi,ti), (xi,i;i), {xi,V2), {x2,V2) ix2,V3), (x^^vs), (2:3, U4) (2/1, 1^4), (2/1, ii) form a path in G. 

Suppose that X3 disappears before to + 2T'. Now yi tries to create yi+B which (if this succeeds) 
tries to create yi + 2B, etc. until reaching xi. If we reach xi this defines a path again between 
(xi,ti) and (j/i,ti). 

Suppose that one of these creations, e.g. the creation of yi + B does not succeed. Then a cell 
exists in 

[yi + B+, yi + 2B~] x [to + 2T', h + iT']. 

The space is not sufficient for this cell to be born spontaneously, therefore it can be traced back 
via creations to one of xi,X2,X3 or to some cell existing during [to,io + 2T*] between yi and 
X2- This way again a path can be defined. 

1.2. Suppose that the cell in X2 disappears before to + 2T*. 

This has been caused by Arbitrate, via a cell yi about to create another cell intersecting with 
the body of X2; as above, cell yi exists during the whole of [to — 8T*,ti]. Then either yi — xi 
or yi G [x2 — 2B-\-, X2 — B—\. In the second case, there are points vi such that (xi, ii), (xi, vi), 
{xi,V2), {x2,V2) {x2,V3), {yi^vs), yi,ti) form a path in G. Suppose therefore j/i = xi. 

After X2 dies, either yi succeeds creating yi — B before time to + 4r' or some cell (a;2,U2) 

yi — B. In both cases, we can continue as 



in part 1.1 above with x'2 in place of X2 and to + 5T* in place of to + 2T' . 
. Suppose that the cell in Xi disappears before time ti. 

This has been caused by Arbitrate, via a cell yi about to create another cell intersecting with the 
body of xi; again, yi exists through all the time considered here. If yi > xi then we can continue 
as in part |^ above with yi in the role of Xi above and xi in the role of X2 above. If yi < xi then 
we can do the same, after a left-right reflection. 

□ 



Lemma 18.4. The simulations of our amplifier have the control delegation property. 

Proof. Let M = M^, ^ = for some k, for our amplifier, and let 77 be a trajectory of M. Let 
T*=T\, T'* =T\+i, etc. Let 

I ^T{xo,{ST'*/T,* +4)QB). 

Assume that I x {ti ~ 8T'2} is blue in 77* — ^*{ri) and / x [<i — 16r*2,ti] is damage-free in 77. We 
must prove that {xo,ti) is blue in 77. 

As mentioned at the beginning of Section |ll|, the simulation (p^ consists of two parts: the simula- 
tion of Mk+i by a medium of reach 2, and the simulation of by Mk as shown in Theorem 8.1C| . 



Control delegation from to Mk is trivial, since the cells of the two media are essentially the same; 
hence, we need to consider only control delegation from Mk+i to M^. 



Lemma 18.3 implies that there are sites zi < < Z2 and a path in the graph G{rf) defined on 
/ X [toj^i], connecting {zi,ti) with (z2,ti). This path consists of blue big cells: indeed, since the 
area is controlled in 77* birth is excluded, therefore a non-blue big cell would have to trace back its 
origin to outside /, but the complement of / is too far to do this in the available time. The path 
gives rise in a natural way to a sequence (ci, . . . ,c„) of small cells with ci — (zi,ti), c„ = (z2,<i), 
and for each i, either Ci and Ci+i are in two consecutive dwell periods of the same cell, or have the 
form {x,t), {y,t) with |a; — y| < 2B. All these cells are member or extension cells of the colonies of 
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the big cells of the path. Since Color is a broadcast field and Guard = — f in all our cells, these 
small cells are all blue. Let us form a polygon Pi connecting these points with straight lines. 

Let A be the set of elements of / x [ti — IQT'2, ti] reachable from {xq, ti) along any polygon at 
all without crossing Pi. It is easy to see that A is controlled; let us show that it is also blue. Indeed, 



according to Lemma 18.1, not birth can take place in a controlled area, therefore from any cell in A 
a path of ancestors passes to the outside of A. When it crosses Pi one of its cells must be equal to 
a blue cell and therefore the whole path must be blue. □ 

18.2. The program of a germ. We now add some rules to the program of our amplifier to make 
it self-organizing. A germ starts out as a single cell with Age — and Addr = —Q and tries to grow 
into an area covering 3 colonies by growing right, but can be held up by some other cells. The cell 
with address —Q will be called the leader of the germ (calling it the "base" would be confusing). 
Germ cells are weaker than non-germ cells, hence a growing germ cannot intrude into an extended 
colony and a growing colony can destroy germ cells in its way. In what follows, the germ growth 
rule is based on work periods just as colonies. Not all germ work periods are alike: each work period 
is associated with a level. Let 

(18.1) I = log(3g) 

(not necessarily an integer). The level of the germ is stored in a field 

Level e [0, 1-] U 00 

of each of its cells. Cells stronger than germ cells have level 00. Latent cells are level germ cells; 
thus, even latent cells have color. Age will be reset to at the beginning of each work period, and 
the work period of level s lasts until Age = U (s) where 

U{s) = 4plA2^ 

We say that a work period has level s, if the leader cell has that level. Typically, in this case 
the germ has already been extended to a size at least 2*_B. Therefore if a right edge cell has 
Addr + Q < 2^°'^°' — 1 then it will be called exposed. This extends the definition of exposed edges 



given in Subsection |12.l| without affecting any previous reasoning using this notion. There is also a 
field 

Active e {0, 1}. 

Cells with Active = 1 are called active, others passive. Here are the rules using these properties: 

1. If a germ cell sees a (not necessarily adjacent) germ cell on the left with the same color but 
higher level then it becomes vacant. 

2. When a passive germ cell of level s sees an active germ cell on the left, of the same color and 
level, with Age < U{s) — 2pi2'*, then it turns vacant. 

The work period is divided into two parts: growth and computation. In growth, the right end 
attempts to grow to size 2^*+^. The computation part is omitted if the right end has address 2Q — 1 
(the final goal). This part, which takes the 2**+^ last steps of the work period, checks whether the 
germ has size 2^+^i?, by propagating left the information about the address of the right edge in a 
field NewJevel. liNewJevel > Level then in the next work period, each cell sets Level := NewJevel. 
Otherwise, the leader cell chooses a random number in {0, 1} and broadcasts it right on the track 
Active. This ends the computation. 

Remark 18.5. Probably a much simpler rule (e.g. a random decision each time there is a conflict) 
would do but our rule makes the proof very simple. On the other hand, the estimates of the proof 
would be much better with a somewhat more complicated rule that allows for some error-correction 
in the germ cells, e.g. erasing a small germ surrounded by cells of different color. 
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18.3. Proof of self-organization. 

Lemma 18.6 (Germ Attribution). Let ci = {xi,ti) he a blue germ cell, of level s. Assume that the 
area 

[xo - 3 • xo + X [ti - 2 ■ 2^1, ti + 2U{s)X] 

is free of damage and of nonblue cells. Then either ci is part of a domain with no exposed edge or 
there is a blue cell of higher level in the above area. 

Proof. Let us construct a path Pi backward from ci in such a way that we keep the path on the 
left edge of the germ: thus, whenever possible we move left on a horizontal link, otherwise we move 
back in time on a vertical link. At time 

to = ti ~ 2^Ti, 
the path might stop at a cell cq one level lower than Ci. 

1. There is a time t^ in [toi^i] when Piit^) is a leader and the germ has size > 2*. 

Proof. Suppose this is not so. If the right end is exposed at time to then it remains so and the 
germ decays away in the given time. If the left end is exposed during all of [to, to + (split_time)] 
then the left end decays, becomes unhealable, and the germ will decay. Suppose therefore that 
both ends are protected by some time t'^ < to + (split_time) . If the right end has level s at this 
time, then it has the desired size, too. 

Suppose the germ at the time t^ has size < and the right end has a lower level allowing 
this size. If the left end is younger than the computation start then the computation would never 
allow it to switch to a higher level with this small size, so the size 2'^B must be reached at some 
time ^3. If it is older than the computation start then (unless an end becomes exposed and the 
germs dies on that account) the age advances quickly to the new work period, and the whole germ 
obtains the new, higher level. If the germ is still small its right becomes exposed and then the 
whole germ will be wiped out. 
Let us follow the development of the germ from t^ forward. If the left end survives then the germ 
survives till time ti and we are done. If the left end will be destroyed by a cell C2 with higher level 
then we are done. 

The last possibility is that the left end will be destroyed by an active cell C4 on the same level. Let 
us construct a path P2 backward from C4 in a way similar to Pi . By the reasoning of part |] above 
we arrive at a time ^5 at which ^2(^5) is a leader and the germ has size > 2^. 

Following the development of this (active) germ forward, its left end either survives or will be killed 
by a cell C2 of level > s (in which case we are done, having found C2). If the left end survives then 
the germ of C4 kills the germ of ci, growing over at least 2* killed cells. (It is easy to see that new 
germ cells arising in place of the killed ones do not have sufficient time to reach level s and are 
therefore no obstacle to this growth.) This creates a germ of size > 2"+^ whose level will increase 
after its next computation notices this. All this happens within at most 2 work periods of the germ 
of C4. □ 
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Lemma 18.7 (Level Increase). For a level s > 0, integers n > 1, r > 3 ■ 2''_B, some site xq, some 
rational time to, with cq = (xo,to)i 

di = di{s) = 3C/(s)pi, 

/o -r(Lo,3gs), 

Go ~ [to ~ 2 ■ 2'*ri,to], 
Gi = [to — 3T', to], 
Hi = [to,to + ndiT']. 

Assume 

(18.2) ndiT'<T,*. 

Let Qo be the event that lo x (Gq U Hi) is damage-free, lo x Gi is blue, and cq has level s. Let 
TLi be the event that Lo x (Gi U Hi), is blue and contains a cell of level s + 1. Then there is a 
constant ki > such that the probability of —Hi fl Qo is at most e^^^". Moreover, this estimate is 
also understood in the sense of an injective canonical simulation. 

Proof. 

1. All cells in Lo x Hi are blue. 

Proof . Non-blue germ cells cannot arise in a blue area. If a non-blue non-germ cell would occur 
it could be attributed to a full non-blue colony. It takes two full consecutive colony-occupations 
to re ach in to Lo from outside Lo, and this takes at least T,* time units which is, by the assump- 
tion ( 18.2 ), more than our total time. 
Lemma 18.6 shows that, in the absence of damage, either the conclusion of the present lemma holds 
or Co is part of a germ of size at least 2^*, without exposed edges. Let ci — {xi,to) be the left endccU 
of this germ. 

2. The following holds during 

Ji = [io,io + diT'] : 

either a cell with level > s occurs in r(a;i, 2^^^B) or a leader cell C2 = {x2, ^2) with level s occurs 
in [xi + 2'^B,xi + 2'''+^£?— ]. In the latter case, at time t2, all cells between xi and X2 belong to 
the germ of ci , with no place left for cells between them. 

Proof . If ci is killed during [to, to + 2U{s)ti\ from the left by a higher-level cell then we ar e done. 



Suppose that it will be killed by an active cell C2 of the same level. Then Lemma 18.6 (Germ 
Attribution) implies that C2 is contained in a germ without exposed edges, hence of size > 2^. 
This germ conquers the area of the germ of ci within the next 2U{s)ti time units, and creates a 
cell of level s + 1, which is one of the desired outcomes. 

Suppose that this does not happen. If the growth of the germ of ci will not be held up then 
it will create a cell of level s + 1 within 2U (s)ti time units. It can be held up only by a cell of 
higher or equal level. A cell of higher level would give what is required: suppose that this does 
not happen. If it is held up by a cell of the same level, then unless this cell is a leader it decays 
within time ti. Therefore the growth succeeds unless a leader cell C2 of the same level is in the 
way and all cells between xo and xi belong to the germ of ci, with no place left for cells between 
them. 

3. Let ^3=^2+ diT*, J2 = [t2,t'i\. Let ^1(^2) be the event that Level{xi,t2) = s, there is a leader 
cell C2 — {x2,t2) of level s in [xi + 2^3, xi + 2'*+^i3— ], and at time t2 all cells between xi and 
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X2 belong to the germ of ci, with no place left for another cell. The probability that Qq n Ji(t2) 
holds and no cell of level > s occurs during J2 in r(a;i, 2^^^B) is at most 



l-(0.5-e')^- 

Proof . If xi will be killed from the left during J2 then we will be done by the same reasoning as 
in part |^ above. Suppose that this does not happen. 

Suppose that that X2 is the later one among xi and X2 to make a work period transition after 
t2 , at time V2 ■ Let W3 be its next work period transition. Let U2 be the last work period transition 
time of Xi before V2 and 1x3,^4 the next work period transitions of xi. Let ^72(^2) be the event 
that the coin tosses at time M2,it3 make xi active and the coin toss at time V2 makes X2 passive. 

Suppose now that xi is the later one to make its first transition after t2, at time U3. Let M4 be 
its next work period transition. Let V3 be the last work period transition of X2 before M3 and V4 
its next work period transition. Let 1/2(^2 ) be the event that the coin tosses at time W3,f4 make 
X2 passive and the coin toss at time U3 makes xi active. 

The randomization part of the Computation Property implies that, in the absence of damage, 
the probability of ^1(^2) H ^^2(^2) in both cases is at most 1 — (0.5 — e')'^. Let us show that in 
both cases if 1/1(^2) H ^2(^2) holds then Level{xi, U4) = s + 1. i.e. the germ of xi will overrun the 
germ of X2 while it is passive. 

In the first case, suppose that the work period [1)2, W2] of X2 covers the work period [7/3, U4] of 
xi. Then xi certainly has sufficient time to overrun. If [?;2,i^2] does not covers [1*3,^4] then it is 
divided between [7/2,^3] and [7/3, U4], therefore xi will have sufficient time in one of these work 
periods for the overrun. The reasoning is analogous in the other case. 
Repeated application of the above reasoning gives the probability upper bound (1 — (0.5 — e')^)"^-^. 
Indeed, we reason about n disjoint windows (separated by the times t2,t3,t4, . . . where t^+i = 
ti + diT*). If J2{t2) fails then Ji(tz) still holds in the window between ^3 and ^4, so the reasoning 
is applicable to this new window, etc. The probability bounds will multiply due to the disjointness 
of the windows. □ 



Recall the definition of I in (18.1). 



Lemma 18.8 (Birth). For integer n> \, and cq = (xojto), let 



d2 


= 120p?AQ, 


lo 


= r(a;o,33QB), 


Li 


= r(a;o,30QB), 


Go 


= [to - Qn, ^o], 


Gi 


= [^0 — 37"*, fo], 


Hi 


= [to, to + nd2T* 



Let he the event that Iq x (Go U Hi) is damage-free, and Iq x Gi is blue. Let Tii be the event that 
Li X (Go U Hi) contains a big blue cell. 
Assume 

(18.3) nd2T'<T,*. 

There is a constant Ki > such that the probability of —Hi H t/o *s most le^'^^^^ . Moreover, this 
bound is also understood in the sense of an infective canonical simulation. 
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Proof. Repeated application of Lemma 18.7 (Level Increase). Notice that 



E 

0<s<i 



di{s) = 3pi 



U{s) ^ l2plX V 2" < 24p^A2' < 120p^AQ 



0<s<l 



0<s<l 



For level s > 0, let 

=r(xo,6-2^B), 
ts = to + 24np?AT'2". 

Then tg — ts-i + ndi{s)T' for s > 0. Let Tig be the event that a cell of level s appears in Lg before 
time ts- The blueness of Iq implies Hq. Repeated application of Lemma 18.7 (Level Increase) shows 
that the probability of -^Hs H Hi for i < s is at most e"''^" for an appropriate constant ki. From 
here, we obtain the required upper bound on the probability of non-birth. □ 



Proof of Lemma 10. Let 77 be a trajectory of a medium Mk of our amplifier and 77* — $^(77). 
We will use the notation Q = Q^, etc. Let us define some quantities with the absolute constants 
Co, Ci, C2, Ki that will be defined later. 



Oi 

92 



CiB, 
CiQB, 



1)£, 



93 



(24CiQ + 3C2t/ 

e-''i^/^Ci/33. 
a' = qi+q2+ qs- 

Our goal is to show that for every time wq, if rj i s (-Di, (T)-blue at time vq, then rj* is {D2, cr')-blue at 
time vq + Oi. Then we will be done since ( 10^ ) implies cr' < ak+i- 

As mentioned at the beginning of Section |ll|, the simulation t/?^ consists of two parts: the simulation 
of Mfc+i by a medium of reach 2, and the simulation of by Mf^ as shown in Theorem 8.1C . 
1. We will concentrate on the self-organization from M'^. to M^+i. 

The self-organization from Mk to Af^ is similar but much simpler, so we omit it (it refers to the 

proof of Theorem 8.10| ) . 
We will use the inequalities 

T'*/T,* < 3, 



(18.4) 
(18.5) 



T'*/T, < 2U, 



where ( 18.4 ) is the same as ( p.7| ) and ( 18.5 ) follows from (9^). Let vq be a time, and let J2 be a 
space interval of length D2 ■ Let 

d2 = UOplXQ, 

J0=T{J2,D2), 

4^T{J2,D2~B), 
Go = [vq - Qti,vo], 
Gi = [uo - 5r*,i;o], 
Ho = [vq,vo + Oi], 
Hi = [vo,vo +T,*]. 
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Assume that is {Di, a)-h\\ie at time vq. Let £i be the event that there is no damage in Jq x {GqUHq) 
The Restoration Property gives the upper bound 

(18.6) ^3^Ch+Qn^ ^ (24^^^ ^ 3^^^ ^ 



on the probabiUty of where we used ( 18.4 ), ( 18. 5 ) 



Let £2 be the event that there is a subinterval Ji of Jo of size 2.5-Di such that (Jo \ Ji) x Gi is blue. 
2. The probabihty of £1 D -'£2 is upper-bounded by 



(18.7) {D2/B)^al < 8{C,Qfal 



Proof . The proof is a Uttle similar to the proof of Lemma 8.4, only simpler. If 



{iDi/2+[0,Di-])nJo X Gi 

is blue for all i G Z then clearly Jq x Gi is blue. For some i,j with |i — j| > 4, let !Fij be the 
event that this set is not blue, for either i or j. The blueness of r] at time vq implies the upper 
bound on the probability of J^ij. The probability that there is an i,j such that Tij holds is 
at most 9>{D2 / a"^ . If there is no such pair i, j then it is easy to see that £2 holds. 
Under condition of £1 Ci £2^ let Ji be the subinterval introduced in the definition of £2 ■ Let 

/o,, = T{66QBi,S3QB). 

Let £3 be the event that for all i suc h tha t /o,i C Jq \ Ji the set /o,i x (Go U iJi) contains a big blue 
cell. With n < T.*/{d2T'), Lemma |l8j (Birth) gives the upper bound 



log(5Q)e-"^" — 



33QB' 



for an appropriate constant K2, on the probability of £1 n £2 H ->£3. Using (18.4), (18.5) and (9.16) 
we can compute a constant ki for upperbounding the above by 

(18.8) e-'"i^/'5Gi/33. 

We will show that under condition £1 f) £2^) £3, the trajectory 77* is blue over 

Ao^J2-x [vo + Oi-5T.*,vo + Oi]. 



3. Assume that Ji is disjoint from T{J2, D2/2). 
3.1. T{J2,2QB) x iJo contains no non-blue cell. 

Proof. Assume that, on the contrary, it contains such a cell. Let us build a path of ancestors 



from this cell. This path cannot end in a birth inside T{J2, D2/2 — 2B) since Lemma 18.1 
(Lasting Control) implies that no birth takes place there. Let us show that the path will not 
leave this set either. We will show that it does not leave on the right. For this, whenever we 
have a choice between father and mother, we build the path to the left. This way, the path 
only moves to the right when the cell is obtained from its mother by growth or end-healing. 
Once the backward path moved into the originating colony of a growth it will stay there at 
least until the beginning of the work period before it can move right again. It needs therefore 
at least (i — 1)T,* time units to move over i colonies, and its total displacement will be at most 
QB{l + Oi/T,*). For this to be smaller than the distance of the complement of r(J2, D2/2 — 2B) 
from r( J2, 2QB), we need 

QB{1 + Oi/T,*) < D2/2 - 2QB - 2B. 
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Rearranging and using y»*/x'^* < 3 from (9.7) show that this is satisfied if 



(18.9) 2(4 + 3C2) < Ci. 

Therefore the path never leaves r( J2, 132/2 — 2B): this is impossible since the path is not blue 
but the set it is in is blue at time vq- 

Let a; e J2: we will show that x becomes controlled in 77* at some time before uq + Oi — 5T,*. 



Then Lemma 18.1 will imply that it stays controlled until vq + Oi and part 3.1 above shows that 



it is actually blue. 

Let i be such that x £ /o.i. Event £3 implies that /o,i(Go U Hi) contains a big blue cell {y,t). 
This cell is within distance Q&QB from x, say, to the left. As soon as it arises it begins to create a 
right neighbor, which also creates a right neighbor, etc., until the chain either reaches x or will be 
prevented by another blue cell z within distance QB on the right. In the latter case, we continue 
from z, and see that the chain controls x by time t + (SIT**: hence we are done if 

C2 > 73. 

4. Assume that Ji intersects r(J2,Z32/2). 
4.1. There are no non-germ non-blue cells in 

r(j2,2gs) X [vn + QTi,vn + Oi]. 

Proof. The size of Ji is 2.5-Di < QB according to our assumption about Q. Therefore if it has 
any non-blue non-germ cells at time vq then these cells are in domai ns w ith an exposed edge 
and will decay within Qti time units. It can be seen just as in part |3.l| above that non-blue 
non-germ edges don't have time to grow in from outside r(J2, 02). 

Non-blue germ cells in Ji may also begin to grow to the right. But within 5 cell widths, they 



meet an existing blue cell x, as seen in part hi of the proof of Lemma 18.1 (Lasting Control) 



X will not be erased by a non-blue germ cell on the left. It will not be erased by a germ cell 
on the right either since germ cells erase other germ cells only in the right direction. It might 
be erased by a non-germ cell y on the right, that is about to create a left adjacent neighbor to 
itself. Cell y (or, even the neighbor y — B it creates) will then be the next obstacle and it can 
only be eliminated in a similar way. The time between these successive eliminations (except 
possibly between the first two) is at least T,* since for the next occurrence of such an event, we 
must wait for the colony in question to die and for a new colony to overtake and start growing 
a left extension. Therefore the total number of cells that can be added to the non-blue germ in 
this exotic way is at most Oi/T,* + 2 < 3C2 + 2 (using x'*T,* < 3 again), which still leaves it 
a germ if 

(18.10) Co > 3C2 + 2. 

Let a; £ J2 : we will show as in ^ above that x becomes controlled in rj* at some time before 
vo + Oi-dT,*. 

Let lo^i be closest possible to x while disjoint from that Ji. Event £3 implies that /o,i(Go UiJi) 
contains a big blue cell {y,t). This cell is within distance 

(1.25) -01 + 2- QQQB = 1.25CiB -I- 132QB 

from X, say, to the left. As soon as it arises it begins to create a right neighbor, which also creates 
a right neighbor, etc., until the chain either reaches x or will be prevented by another blue cell z 
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within distance QB on the right. In the latter case, we continue from z, and see that the chain 
controls x by time t + 133T**: hence we are done if 

C2 > 139. 

We have shown that the probability that 77* is not blue over Aq is at most a'. We bounded it, 
referring only to events within a window with space projection Jq. It follows that for a group of m 
disjoint space translations of this window, the probability bounds that r]* is not blue over any of the 
corresponding translations of ^0, is bounded by ct"". □ 
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19. Some applications and open problems 

19.1. Non-periodic Gibbs states. Consider spin systems in the usual sense (generalizations of 
the Ising model). All 2-dimensional spin systems hitherto known were known to have only a finite 
number of extremal Gibbs states (see e.g. thus, theoretically, the amount of storable information 
on an n X n square lattice did not grow with the size of the lattice. In 3 dimensions this is not true 
anymore, since we can stack independent 2-dimensional planes: thus, in a cube C„ of size n, we can 
store n bits of information. More precisely, the information content of C„ can be measured by the 
dimension of the set of vectors 

i^J.{a{x) = 1} : X G C„) 

where fi runs through the set of Gibbs states. This dimension can be at most 0{n'^~^) in a d- 
dimensional lattice, since the Gibbs state on a cube is determined by the distribution on its boundary. 
The stacking construction shows that storing U{n'^~^) bits of information is easy. We can show that 
^l{n'^~^) is achievable: in particular, it is possible to store an infinite sequence in a 2-dimensional spin 
system in such a way that n bits of it are recoverable from any n sites with different x coordinates. 

For this, we apply a transformation from (see also |^): a probabilistic cellular automaton 
M in d dimensions gives rise to an equilibrium system M' in (d + 1) dimensions. Essentially, the 
logarithms of the local transition probabilities define the function J and space-time configurations 
of M become the space-configurations of M'. Non-ergodicity of M corresponds to phase transition 



in M' . In the cellular automaton of Theorem 5.6, each infinite sequence g gives rise to a space-time 



configuration storing the bits of g in consecutive cells. Now, each of these space-time configurations 
gives rise to a separate Gibbs state belonging to one and the same potential. 

Let us note that though the Gibbs system is defined in terms of an energy function H{a), it is 
not helpful to represent this energy function in terms of a temperature T as H{a)/T. The reason is 
that the individual terms of -ff (cr) do not depend linearly on the error probability e (or any function 
of it). In fact, we believe it can be proved that if an artificial T is introduced (with T = 1 for a 
certain sufficiently small value of e) then a slight decrease of T destroys the phase transition. 

19.2. Some open problems. 

19.2.1. Turing machines. It is an interesting question (asked by Manuel Blum) whether a reliable 
Turing machine can be built if the tape is left undisturbed, only the internal state is subject to 
faults. A construction similar to reliable cellular automaton seems to be possible. It is not known 
whether there is a simpler construction or, whether there is a way to derive such a machine from 
reliable cellular automata. 

19.2.2. Relaxation time as a function of space size. Consider now Toom's medium as a typical ex- 
ample of a medium non-ergodic for m = oo. It follows from Toom's proof that, for small enough 
fault probability e, we have limm->oo Tm(fi, 1/3) = oo, i.e. the increase of space increases the relax- 
ation time (the length of time for which the rule keeps information) unboundedly. The speed of this 
increase is interesting since it shows the durability of information as a function of the size of the 
cellular automaton in which it is stored. Toom's original proof gives only rm(0, 1/3) > cm for some 
constant c. The proof in [||, improving on [|4|, gives (0,1/3) > e'^™ for some constant c, and 
this is essentially the meaning of saying that Toom's rule helps remember a bit of information for 
exponential time. 

So far, the relaxation times of all known nontrivial non-ergodic media (besides Toom's, the ones 
in ||l^ and ||ll|) depend exponentially on the size of the space. It is an interesting question whether 
this is necessary. In a later work, we hope to show that this is not the case and that there are 
non-ergodic media such that r„i{n, 1/3) < m'^ holds for all n, for some constant c. The main idea is 
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that since the medium will be able to perform an arbitrary reliable computation this computation 
may involve recognizing the finiteness of the space rather early (in time m'^) and then erasing all 
information. 



19.2.3. Relaxation time as a function of observed area. Lemma 5.5 seems to suggest that the issue 
of information loss in a cellular automaton is solved by the question of mixing for m — oo. This is 
not so, however: as noted together with Larry Gray, Leonid Levin and Kati Marton, we must also 
take the dependence of rm(n, S) on n into account. As time increases we may be willing to use more 
and more cells to retrieve the original information. Even if raoin,6) < oo for each m, we may be 
satisfied with the information-keeping capability of the medium if, say, r.m{n, 1.9) > e*^" for some 
constant c. 

In some mixing systems, the dependence of rm(n, S) on n is known. 

Example 1 9.1 ( The contact process.). The contact process is a one-dimensional CCA as defined in 
Subsection ^.4 Let us note that this this process is not noisy: not all local transition rates are 
positive. The process has states 0,1. In trajectory ri{x,t), state r]{x,t) = 1 turns into with rate 1. 
State r](x, t) = turns into 1 with rate X{ri{x ^ l,t) + rj{x -\- 1, t)). It is known that this process has 
a critical rate Ac G [0-I-, oo— ] with the following properties. 

If A < Ac then the process is mixing with the invariant measure concentrated on the configuration 
^ with ^{x) = for all x. If A > Ac then the process is non-ergodic. 

If A < Ac then it is known (see Theorem 3.4 in Chapter VI of |Q) that the order of magnitude 
of roo(^^, <5) is logn. 

If A = Ac then the convergence is much slower, with an order of magnitude that is a power of n 
(see Theorem 3.10 in Chapter VI of |§ and §). <> 

We believe it possible to construct a medium that is mixing for to = oo but loses information 
arbitrary slowly: for any computable function /(rt), and a constant c there is a medium with 

r,„(n, 1.9) >/(n) A 

for finite or infinite to. Notice that this includes functions f{n) like e*^ . Such a result could 
be viewed as an argument against the relevance of the non-ergodicity of the infinite medium for 
practical information conservation in a finite medium. The construction could be based on the 
ability to perform arbitrary computation reliably and therefore also to destroy locally identifiable 
information arbitrarily slowly. 
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